Starred repositories
Code implementation of the paper "World-in-World: World Models in a Closed-Loop World" (ICLR'26 Oral)
A small Syntactic Parser for Turkish Language, created with CKY algorithm.
Web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.
TRScraper, doğal dil işleme uygulamalarında kullanılmak amacıyla geliştirilmiş, Türkçe içerik girilen büyük platformlarda metin madenciliği yapma imkanı sunan bir uygulamadır.
Reverse Instructions to generate instruction tuning data with corpus examples
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Fast and memory-efficient exact attention
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Lightweight, dependency-free Python library and CLI for downloading YouTube videos, playlists, and captions.
Webster's English Dictionary in JSON format, and related Swift parsing utility
Restoring and attributing ancient texts using deep neural networks
Robust Speech Recognition via Large-Scale Weak Supervision
Generating paper titles (and more!) with GPT trained on data scraped from arXiv.
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
State-of-the-art NLP tools for Turkish
A Collection of BM25 Algorithms in Python
Bert-base NLP pipeline for Turkish, Ner, Sentiment Analysis, Question Answering etc.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.
PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.
An autoregressive character-level language model for making more things
This is a tensorflow-based rotation detection benchmark, also called AlphaRotate.
Repository for the paper FFAVOD: Feature Fusion Architecture for Video Object Detection
A framework for few-shot evaluation of language models.