Making large AI models cheaper, faster and more accessible
[npu] use extension for op builder (#5172)
* update extension * update cpu adam * update is * add doc for cpu adam * update kernel * update commit * update flash * update memory efficient * update flash attn * update flash attention loader * update api * fix * update doc * update example time limit * reverse change * fix doc * remove useless kernel * fix * not use warning * update * update
X
Xuanlei Zhao committed
dd2c28a32352de45675ab13e72049a7f2a57e364
Parent: d6df19b
Committed by GitHub <noreply@github.com>
on 1/8/2024, 3:39:16 AM