ML Performance Reading Group Session 7, where we covered the DeepSeek V3 paper. We also discussed some parts of the DeepSeek V2 paper for comparison.
Presenter: Daniel Vega-Myhre
Papers:
1. DeepSeek V3 (https://arxiv.org/abs/2412.19437)
2. DeepSeek V2 (https://arxiv.org/pdf/2405.04434)
Download
0 formats
No download links available.
ML Performance Reading Group Session 7: DeepSeek V3 | NatokHD