I had the same question – looks to me like it's working around that limit by fetching 512MB slices of the model weights with HTTP range requests
HTTP Range Requests Optimize Large Model Weight Loading
By
–
Global AI News Aggregator
By
–
I had the same question – looks to me like it's working around that limit by fetching 512MB slices of the model weights with HTTP range requests
Leave a Reply