SIGN IN SIGN UP

EXPLORE

dhruvhead/llm-d-inference-scheduler MIRROR
aigateway-apiinferencekubernetesnetworking
Go 0 0
dhruvhead/sparkrun MIRROR
dgx-sparkinferencellama-cppsglangvllm
Python 0 0 1
dhruvhead/CTranslate2 MIRROR
avxavx2cppcudadeep-learning
C++ 0 0 32
dhruvhead/huggingface.js MIRROR
api-clienthubhuggingfaceinferencemachine-learning
TypeScript 0 0 59
dhruvhead/vision.cpp MIRROR
computer-visioncppinferencemachine-learningvulkan
C++ 0 0 55
dhruvhead/text-generation-inference MIRROR
bloomdeep-learningfalcongptinference
Python 0 0 95
dhruvhead/ddtree-mlx MIRROR
apple-siliconinferencellmmlxspeculative-decoding
Python 0 0 27
dhruvhead/SwiftLM MIRROR
apple-siliinferenceiosllmmetal
Swift 0 0 72
dhruvhead/speculative-decoding MIRROR
inferencellmllm-inferencemlxmlx-swift
Swift 0 0 31
dhruvhead/Rapid-MLX MIRROR
apple-siliconfastapihacktoberfestinferencellm
Python 0 0 96
dhruvhead/WhisperKit MIRROR
diarizationinferenceiosmacospyannote
Swift 0 0 44
dhruvhead/faster-whisper MIRROR
deep-learninginferenceopenaiquantizationspeech-recognition
Python 0 0 12
dhruvhead/whisper.cpp MIRROR
inferenceopenaispeech-recognitionspeech-to-texttransformer
C++ 0 0 38
dhruvhead/ort MIRROR
aiai-trainingfine-tuninginferencemachine-learning
Rust 0 0 49
dhruvhead/gpustack MIRROR
ascendcudadeepseekdistributed-inferencegenai
Python 0 0 60
sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

attentionblackwellcudadeepseekdiffusion
Python 0 0 61
dhruvhead/vllm-ascend MIRROR
ascendinferencellmllmopsllm-serving
C++ 0 0 91
ggml-org/whisper.cpp MIRROR

Port of OpenAI's Whisper model in C/C++

inferenceopenaispeech-recognitionspeech-to-texttransformer
C++ 0 0 45
hpcaitech/ColossalAI MIRROR

Making large AI models cheaper, faster and more accessible

aibig-modeldata-parallelismdeep-learningdistributed-computing
Python 0 0 38
vllm-project/vllm MIRROR

A high-throughput and memory-efficient inference and serving engine for LLMs

amdblackwellcudadeepseekdeepseek-v3
Python 0 0 70
PAGE 1 NEXT →