Making large AI models cheaper, faster and more accessible
[Feat]Inference RPC Server Support (#5705)
* rpc support source * kv cache logical/physical disaggregation * sampler refactor * colossalai launch built in * Unitest * Rpyc support --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
R
Runyu Lu committed
18d67d0e8e79c22bded0745c7d3daf8ca40d445c
Parent: de4bf3d
Committed by GitHub <noreply@github.com>
on 5/14/2024, 2:00:55 AM