清华大模型团队为您呈现最高效的大模型学习和解决方案
DeepSeeK 开源周
【Day1】 FlashMLA - let's geek out
【Day2】 DeepEP - 第一个用于 MoE 模型训练和推理的开源 EP 通信库
【Day3】 DeepGEMM - 大道至简的通用矩阵运算
【Day4】并行策略优化 - 将并行进行到底
【Day5】 Fire-Flyer 文件系统 - 让数据处理坐上高铁
【Day6】 DeepSeek 如何做到利润率 545%DeepSeek 深度教程
DeepSeek从入门到精通
DeepSeek指导手册
DeepSeek-R1:通过强化学习激励LLMs的推理能力
DeepSeekV3技术报告
DeepSeek_VL2技术报告DeepSeek 深度教程
DeepSeek从入门到精通
DeepSeek本地部署
DeepSeek实战技巧
刘知远团队大模型公开课
李宏毅机器学习系列课程
李沐大神《动手学深度学习》
Prompt-Engineering-Guide
openai-cookbook
anthropic-cookbook
generative-ai-for-beginners
promptflow
Awesome-Prompt-Engineering
LangGPT
SuperPrompt
promptfoo
Learning-Prompt
code2prompt
tree-of-thoughts
Learn_Prompting经典书籍
大规模语言模型:从理论到实践
大语言模型
动手做AI Agent
Generative AI Handbook: A Roadmap for Learning Resources
Understanding Deep Learning
Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software
自然语言处理:大模型理论与实践
Hugging Face Course
Google Machine Learning Crash Course
Illustrated book to learn about Transformers & LLMs
Building LLMs for Production: Enhancing LLM Abilities and Reliability with Prompting, Fine-Tuning, and RAG
面向开发者的LLM入门教程
Foundations of Large Language Models
动手学深度学习
动手学大模型Dive into LLMs
Build a Large Language Model (From Scratch)
多模态大模型
大型语言模型实战指南:应用实践与场景落地
Hands-On Large Language Models
动手学强化学习
大模型基础
CS324: Large Language Models
CS229: Machine Learning
CS230: Deep Learning
CS231n: CNN for Visual Recognition
CS224n: NLP with Deep Learning
CS224w: Machine Learning with Graphs
CS224u: Natural Language Understanding
CS234: Reinforcement Learning
CS330: Deep Multi-task Learning
CS25: Transformers United
Stanford ML Explainability
Stanford NLP
CMU CS 11-711: Advanced NLP
CMU CS 11-747: Neural Networks for NLP
CMU CS 11-737: Multilingual NLP
CMU CS 11-785: Deep Learning
CMU CS 11-777: Multimodal ML
CMU CS 10-708: Probabilistic Graphical Models
CMU LTI Low Resource NLP
MIT OpenCourseWare
MIT 6.034: Artificial Intelligence
MIT 6.S094: Deep Learning
MIT 6.S191: Introduction to Deep Learning
MIT 6.S192: Deep Learning for Art
CS221: Artificial Intelligence
MIT 6.5940: TinyML
DeepSeek-R1
DeepSeek-V3
DeepSeek-VL2
Attention Is All You Need
BERT
GPT-3
PaLM
InstructGPT
Constitutional AI
LLaMA
GPT-4
PaLM 2
RWKV
Llama 2
Code Llama
Mistral 7B
Phi-2
Mixtral 8x7B
Stable LM 3B
arXiv LLM Papers
The First Law of Complexodynamics
Recurrent Neural Network Regularization
Keeping Neural Networks Simple
Pointer Networks
Order Matters: Sequence to Sequence for Sets
GPipe: Easy Scaling with Micro-Batch Pipeline Parallelism
Deep Residual Learning for Image Recognition
Multi-Scale Context Aggregation by Dilated Convolutions
Neural Message Passing for Quantum Chemistry
Neural Machine Translation by Jointly Learning to Align and Translate
Identity Mappings in Deep Residual Networks
A Simple Neural Network Module for Relational Reasoning
Variational Lossy Autoencoder
Relational Recurrent Neural Networks
Neural Turing Machines
Deep Speech 2
Scaling Laws for Neural Language Models
A Tutorial on the MDL Principle
Machine Super Intelligence
Kolmogorov Complexity and Algorithmic Randomness
Stanford's CS231n CNN for Visual Recognition
Quantifying Complexity in Closed Systems
Gemini
Claude 3
Papers with Code LLM
The Unreasonable Effectiveness of RNNs
Understanding LSTM Networks
LLaMA-Factory
360-LLaMA-Factory
unsloth
TRL
Firefly
Xtuner
torchtune
Swift
AutoTrain
OpenRLHF
Ludwig
mistral-finetune
aikit
H2O-LLMStudio
LitGPT
LLMBox
PaddleNLP
workbench-llamafactory
TinyLLaVA Factory
LLM-Foundry
lmms-finetune
Simplifine
Transformer Lab
Liger-Kernel
ChatLearn
nanotron
Proxy Tuning
Effective LLM Alignment
Autotrain-advanced
Meta Lingua
Vision-LLM Alignemnt
finetune-Qwen2-VL
Online-RLHF
InternEvo
veRL
Oumi
Kiln
LM Studio
LLM Pricing
NVIDIA ChatRTX
ollama
Open WebUI
Text Generation WebUI
Xinference
LangChain
LlamaIndex
lobe-chat
TensorRT-LLM
vllm
LlamaChat
chat-with-mlx
Open Interpreter
Chat-ollama
chat-ui
MemGPT
koboldcpp
LLMFarm
enchanted
Flowise
Jan
LMDeploy
RouteLLM
MInference
Mem0
SGLang
AirLLM
LLMHub
YuanChat
LiteLLM
GuideLLM
LLM-Engines
OARC
g1
MemoryScope
OpenLLM
Infinity
optillm
LLaMA Box
ZhiLight
DashInfer
LocalAI
ktransformers
LangChain
GPT4All
Unstructured.io
LlamaIndex
dify
langfuse
Auto-GPT
PrivateGPT
LangChain Text Splitters
Unstructured.io
LlamaIndex
TextCortex
Label Studio
Texthero
Snorkel
Prodigy
DataTorch
Tabula
Adobe PDF Services API
Great Expectations
Kedro
Weights & Biases
Cleanlab
DeepSpeed
Doccano
Rubrix
Argilla
DataPrep.ai
Haystack
Datasets CLI
PDFPlumber
Nougat
Grobid
PdfMiner.six
OCRmyPDF
Camelot
DocTR
PaddleOCRDeepSeek 深度教程
AnythingLLM
MaxKB
RAGFlow
Dify
FastGPT
Langchain-Chatchat
QAnything
Quivr
RAG-GPT
Verba
FlashRAG
GraphRAG
LightRAG (SylphAI-Inc)
GraphRAG-Ollama-UI
nano-GraphRAG
RAG Techniques
ragas
kotaemon
RAGapp
TurboRAG
LightRAG (HKUDS)
TEN
AutoRAG
KAG (OpenSPG - knowledge-enhanced)
Fast-GraphRAG
Tiny-GraphRAG
DB-GPT GraphRAG
Chonkie
RAGLite
KAG (OpenSPG - logical form)
CAG
MiniRAG
XRAG完整的 AI 定制解决方案
为您呈现系统性精选的 AI 开源课程和 AI 开源工具一站式搜索,节省您在碎片化信息里的时间消耗
基于您的学习进度为您量身定制专属学习计划,最大程度提升学习效率
Mentor Copilot 随时进行专业知识的答疑解惑,为您提供最专注的学习体验
从大模型理论创新者的视角深度剖析最前沿的 AI 技术。为您提供专业的咨询参考
从场景定制,模型定制,数据处理,模型训练,生产环境推理服务搭建,为您解决最真实的场景需求
团队成员来自清华大学专业大模型团队和一线互联网资深 AI 工程师,为您提供最专业的 AI 咨询服务
清华大模型团队为您呈现最好的开源解决方案和开源课程。
及时了解我们工具的一切最新信息