Advertisement
Advertisement
β‘ Community Insights
Discussion Sentiment
50% Positive
Analyzed from 95 words in the discussion.
Trending Topics
#deepseek#context#model#flash#quality#openai#https#models#talking#degradation
Discussion Sentiment
Analyzed from 95 words in the discussion.
Trending Topics
Discussion (4 Comments)Read Original on HackerNews
https://www.reuters.com/world/china/openai-accuses-deepseek-...
From HF: 284B parameters (13B active), 1M context window.
This is indeed some kind of compressed context and the quality goes down as the context grows. IIRC the V4 paper had some numbers on this
https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash