PenghuiCheng for integrating weight-only quantization to accelerate transformer-based models on @Intel platforms
https://
python.langchain.com/docs/integrati
ons/llms/weight_only_quantization/
… And JamsheedMistri for adding the [
@layerup_
](
https://
x.com/layerup_) security wrapper for LLMs
https://
python.langchain.com/docs/integrati
ons/llms/layerup_security/
… (5/n)
Weight-Only Quantization and Security Wrapper Integration for LLMs
By
–
Leave a Reply