Highlights
- Pro
Stars
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
🎤 The easiest way to transcribe audio in Swift
Train transformer language models with reinforcement learning.
Robust recipes to align language models with human and AI preferences
Chiowe / ChinaTextbook-
Forked from TapXWorld/ChinaTextbook所有小初高、大学PDF教材。
PyLipidParse is a lightweight Python library for converting standard lipid notation into RDKit and SMILES representations.
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
A plugin template for Zotero.
EDUKG: a Heterogeneous Sustainable K-12 Educational Knowledge Graph
一个精心整理的公开教育数据集列表,专为教育数据挖掘、学习分析及相关领域的研究人员设计。
Retrieval and Retrieval-augmented LLMs
🚀 「大模型」2小时从0训练65M参数的视觉多模态VLM!🌏 Train a 65M-parameter VLM from scratch in just 2 hours!
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
LostRuins / koboldcpp
Forked from ggml-org/llama.cppRun GGUF models easily with a KoboldAI UI. One File. Zero Install.
Data-driven elucidation of flavor chemistry
Open-sourced dialogue foundation model for Chemistry and molecule science
A convenient wrapper around PubChem PUG REST API that allows to search for many compound properties available at PubChem with ease
Python library for processing (tandem) mass spectrometry data and for computing spectral similarities.