Deepseek News & Updates
Your central hub for AI news and updates on Deepseek. We're tracking the latest articles, discussions, tools, and videos from the last 7 days.
All (6)
0 news
5 posts
1 tools
0 videos
17
Jan
16
Jan
15
Jan
14
Jan
13
Jan
12
Jan
11
Jan
No news articles found
Check back soon or explore other content types
No videos found
Check back soon for video content
17
Jan
16
Jan
15
Jan
14
Jan
13
Jan
12
Jan
11
Jan
Community talk
DeepSeek set to launch next-gen V4 model with strong Coding ability, Outperforms existing models
I reproduced DeepSeek's mHC at 1.7B params (8xH100). The instability is 3x worse than reported (10k vs 3k), but the model didn't explode.
[R] (DeepSeek) Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
DeepSeek introduces Engram: Memory lookup module for LLMs that will power next-gen models (like V4)
[R] Why doubly stochastic matrix idea (using Sinkhorn-Knopp algorithm) only made popular in the DeepSeek's mHC paper, but not in earlier RNN papers?