Skip to content

repositories Search Results · topic:inference org:microsoft

Filter by

0 results
 (186 ms)

0 results

inmicrosoft (press backspace or delete to remove)

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
  • Python
  • 33.8k
  • Updated
    3 hours ago

AICI: Prompts as (Wasm) Programs
  • Rust
  • 1.9k
  • Updated
    15 hours ago

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
  • Python
  • 1.8k
  • Updated
    yesterday

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
  • Python
  • 327
  • Updated
    23 days ago

A large-scale simulation framework for LLM inference
  • Python
  • 130
  • Updated
    11 days ago

Dynamic batching library for Deep Learning inference. Tutorials for LLM, GPT scenarios.
  • Python
  • 72
  • Updated
    on May 17, 2023

A platform that enables users to perform private benchmarking of machine learning models. The platform facilitates the evaluation of mode…
  • Python
  • 3
  • Updated
    11 days ago
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! 
Press the
/
key to activate the search input again and adjust your query.
Package icon

Sponsor open source projects you depend on

Contributors are working behind the scenes to make open source better for everyone—give them the help and recognition they deserve.Explore sponsorable projects
ProTip! 
Press the
/
key to activate the search input again and adjust your query.