Nice validation here that my short tutorial on building and publishing a Python library to PyPI works as intended!
@simonw
-
Build Ship and Ship Guilt-Free Moving Forward
By
–
Get it to a point where it does something useful, then put it online and write about what it is and how you built it Once I've done that I find I don't feel guilty moving on to working on something else
-
Experience versus intelligence: reframing expertise assessment
By
–
I often find myself mentally swapping "smartest" for "most experienced", since "experience" helps imply that it's for a specific field, not everything
-
Using Large Language Models: A Year-Long Series Guide
By
–
A year later here's my series of posts about this https://
simonwillison.net/series/using-l
lms/
… -
Tokenizer Training Data: Understanding Token Presence Significance
By
–
If uberinternal WAS a token that tells you that it was one of the top ~30,000 character sequences present in the text they used to build the tokenizer – which is a different corpus from the training set used to train the model uberinternal not being a token doesn't tell you much
-
From Apple Notes to GitHub Issues: Capturing and Implementing Ideas
By
–
I use Apple notes for this about a dozen times a day, but I try to turn the more interesting notes into GitHub issues threads if I want to actually implement the ideas that I captured
-
LLM Gullibility Undermines Agent System Viability
By
–
LLMs are inherently gullible. Until we solve that a lot of these "agent" dreams aren't actually going to work very well in practice!
-
Jina AI Open Source Repository Availability Question
By
–
The FAQ says the code is open source, but I couldn't find the repo under https://
github.com/jina-ai – is the repo not public yet? -
Legal compliance versus human trust in technology
By
–
Legally fine isn't the same thing as not feeling betrayed
-
NYT lawsuit fair use argument against AI training
By
–
The first four pages of the NYT lawsuit are worth reading – the legal argument rests on "fair use", and one of the factors in whether or not something is fair use is how much it competes with the content it has duplicated https://
nytco-assets.nytimes.com/2023/12/NYT_Co
mplaint_Dec2023.pdf
…