Yeah and it tries half context if nothing fits at full. Saves you from downloading models that won't work
Model optimization: context reduction saves download resources
By
–
By
–
Yeah and it tries half context if nothing fits at full. Saves you from downloading models that won't work