Making large AI models cheaper, faster and more accessible
[Inference] Remove unnecessary float4_ and rename float8_ to float8 (#5679)
S
Steve Luo committed
725fbd2ed067f9c58ac04670377d3e6f2a96fe00
Parent: 537a3cb
Committed by GitHub <noreply@github.com>
on 5/6/2024, 2:55:34 AM