Discussion about this post

User's avatar
James's avatar

Wow this is pretty vindicating for those who believe in the scaling of dense models over the goodhart's law following MoE sparse models. I do think quantifying this would be a good idea even though you have distaste for such though. A shame we can't see 4.5 unfortunately... Do you think you could try with o3, o1 and gpt5? A quick heads up, going into Settings -> Data Controls -> Sharing -> You should find a feature, "Share inputs and outputs with OpenAI", enabling that gives you 1 million free credits per day for the best models, although OpenAI might go and train on that, thus defeating your goal! haha!

Expand full comment
Gunnar Tausch's avatar

Interesting. I wonder what would happen if you try this in polar coordinates and display the result on a sphere. Are the results reproducible? My guess would be: no generalisation beyond longitude/ latitude training data.

Expand full comment
21 more comments...

No posts