DeepSeek LLM Scaling Open-Source Language Models with Longtermism
21 JanThis paper discusses the development of DeepSeek LLMs, focusing on scaling laws, pre-training methodologies, and evaluation results across various benchmarks in both English and Chinese. It highlights the importance of data quality in model performance and outlines future directions for enhancing the capabilities of open-source language models.
DeepSeek LLM Scaling Open-Source Language Models with Longtermism
21 JanThis paper discusses the development of DeepSeek LLMs, focusing on scaling laws, pre-training methodologies, and evaluation results across various benchmarks in both English and Chinese. It highlights the importance of data quality in model performance and outlines future directions for enhancing the capabilities of open-source language models.