Skip to content

hubojing/LLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 

Repository files navigation

LLM

大模型自用资料。

前沿解读

GGUF模型下载

论文 & 模型 & 技术报告

deepseek

基座模型

代码模型

数学推理

  • DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models https://arxiv.org/abs/2402.03300
  • DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

定理证明

  • DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via RL for Subgoal Decomposition
  • DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search https://arxiv.org/abs/2408.08152
  • DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data https://arxiv.org/abs/2405.14333

多模态

  • DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding https://arxiv.org/abs/2412.10302
  • DeepSeek-VL: Towards Real-World Vision-Language Understanding

其它

Qwen

Gemini

Kimi

GLM

o1

其它

  • ChatGLM2 CODE
  • LLaMA: Open and Efficient Foundation Language Models PDF CODE
  • ChatGLM CODE
  • PaLM: Scaling Language Modeling with Pathways PDF
  • InstructGPT PDF
  • GPT 3.0 PDF
  • T5: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer PDF CODE
  • GPT 2.0 PDF
  • GPT 1.0 PDF
  • BERT PDF
  • Transformers PDF

Transformer原理

工具

书籍

评测

About

大模型相关的自用资料。

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors