Discussion about this post

User's avatar
Sander Land's avatar

Nice work! Have you seen our paper in this area? It may be of interest for looking at embeddings, quite simple things are pretty effective: https://arxiv.org/abs/2405.05417

I haven't run deepseek v3 myself as it's so big, but v2 is here: https://github.com/cohere-ai/magikarp/blob/main/results/reports_mini/deepseek_ai_DeepSeek_V2_Lite.md

Expand full comment
Mike's avatar

it is pretty hard to make it write "Nameeee" exactly as it is, but somehow it was able to do that after 5 messages in chat with r1. Also funny to see model trying to justify itself

Expand full comment
6 more comments...

No posts