repositories Search Results · topic:inference org:microsoft
Filter by
0 results
(186 ms)0 results
inmicrosoft (press backspace or delete to remove)DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
- Python
- 33.8k
- Updated 3 hours ago
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
- Python
- 1.8k
- Updated yesterday
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
- Python
- 327
- Updated 23 days ago
Dynamic batching library for Deep Learning inference. Tutorials for LLM, GPT scenarios.
- Python
- 72
- Updated on May 17, 2023
A platform that enables users to perform private benchmarking of machine learning models. The platform facilitates the evaluation of mode…
- Python
- 3
- Updated 11 days ago
Sponsor open source projects you depend on
Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projectsProTip!
Press the /
key to activate the search input again and adjust your query.Sponsor open source projects you depend on
Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projectsProTip!
Press the /
key to activate the search input again and adjust your query.