Maybe I missed something or just forgot, but I remember right this is the first time I've seen data from @internetarchive mentioned in court docs filed for the case. A lot of attention has been on Libgen, but downloading Internet Archive for Llama-training seems noteworthy.
Internet Archive Data Used for Llama Model Training Disclosed
By
–
Leave a Reply