Discussion about this post

User's avatar
Jerry's avatar

Also worth noting, since I've been learning a lot about how transformers and LLMs work the past few days and am seeing everything in relation to that (https://www.reddit.com/r/PhilosophyMemes/s/zsZjEkcg2B): at the end of a run, the transformer gets a probability distribution over all possible tokens of which comes next. Using a "greedy" selection algorithm, where it always picks the absolute highest most likely word to go next, actually doesn't give as good results in practice as using some other algorithm to pick from among the top choices. The "temperature" parameter modulates how far from the highest probability token it will go

Jerry's avatar

https://forum.effectivealtruism.org/posts/B6d8Wzk4gNzHsXvdi/ai-safety-is-extremely-bottlenecked-on-grantmakers

"hiring one fewer grantmaker usually means those millions will just sit in an account for another year rather than being deployed to useful ends. And when a strong candidate turns down a CG offer, the result is often not “a slightly-less-good grantmaker," it’s just one fewer grantmaker. We routinely close rounds with fewer hires than we'd planned for."

Just an example of this dynamic, imo. See also the comments on that thread.

4 more comments...

No posts

Ready for more?