Lists (1)
Sort Name ascending (A-Z)
Stars
A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.
AI agents running research on single-GPU nanochat training automatically
Hundreds of models & providers. One command to find what runs on your hardware.
Natural Gradient Boosting for Probabilistic Prediction
Performance of various open source GBM implementations
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning al…
A powerful Zotero AI and MCP plugin with ChatGPT, Gemini 3.1, Claude, DeepSeek V4, Grok, OpenRouter, Kimi 2.5, GLM 5, SiliconFlow, GPT-oss, Gemma 4, Qwen 3.5
Polars plugin for pairwise distance functions
Hierarchical Reasoning Model Official Release
Gin provides a lightweight configuration framework for Python
💫 Industrial-strength Natural Language Processing (NLP) in Python
A library of sklearn compatible categorical variable encoders
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
A non-validating SQL parser module for Python
Uses tokenized query returned by python-sqlparse and generates query metadata
ODBC (Open Database Connectivity) bindings for Rust.
A light-weight, flexible, and expressive statistical data testing library
python library for graphical and continuous representations of ICD9 and ICD10 codes
XGBoost for label-imbalanced data: XGBoost with weighted and focal loss functions
Fill Apache Arrow record batches from an ODBC data source in Rust.
Read Apache Arrow batches from ODBC data sources in Python
Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with the Python Database API Specification 2.0.