Quantization
https://paperswithcode.com/task/quantization
Vector Quantization
- https://ieeexplore.ieee.org/document/1162229
- GPTVQ
- Example: https://scikit-learn.org/stable/auto_examples/cluster/plot_face_compress.html
& Maarten Greentendorst
- Quantization FP16 FP8, and INT8
Apple, WWDC
- https://developer.apple.com/
videos/play/wwdc2023/10047/ - https://apple.github.io/
coremltools/docs-guides/ source/opt-overview.html ( website)
No comments:
Post a Comment