kashif

Kashif Rasul kashif

Principal Research Scientist working on Deep Learning, Time Series Forecasting, Reinforcement Learning and HPC.

1.2k followers · 139 following

Berlin, Germany
08:45 (UTC +02:00)
@krasul

Achievements

x3 x4 x3

Achievements

x3 x4 x3

Highlights

Stars

huggingface / ml-intern

🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models

Python 6,415 580 Updated Apr 25, 2026

microsoft / ArchScale

Simple & Scalable Pretraining for Neural Architecture Research

Python 329 34 Updated Mar 31, 2026

splunk / cisco-time-series-model

Cisco Time Series Model is a continued pretrained time series forecasting model developed by Cisco.

Jupyter Notebook 26 3 Updated Apr 24, 2026

JeanKaddour / tpo

Target Policy Optimization (JAX)

Python 24 Updated Apr 18, 2026

thunlp / OPD

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Python 142 5 Updated Apr 18, 2026

Smlcrm / TempusBench

Unified benchmarking framework for time series forecasting, comparing traditional and foundation models with automated pipelines and isolated execution.

Python 11 1 Updated Apr 24, 2026

DBatUTuebingen / DiDi

Dissecting the Duck's Innards — A DuckDB-based course on the Design and Implementation of Database System Internals

C 328 8 Updated Apr 7, 2026

lewtun / parameter-golf

Forked from openai/parameter-golf

Train the smallest LM you can that fits in 16MB. Best model wins!

Python 17 1 Updated Mar 23, 2026

Joluck / MiSS

MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an excellent balance between performance and efficiency.

Python 35 1 Updated Mar 9, 2026

Dimillian / Skills

My Codex Skills

Shell 3,387 180 Updated Mar 29, 2026

gstohl / godot-mujoco

C# 1 Updated Feb 7, 2026

facebookresearch / dllm_post_training

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Python 8 1 Updated Apr 10, 2026

facebookresearch / wt-asbs

Code for “Enhancing Diffusion-Based Sampling with Molecular Collective Variables"

Python 18 3 Updated Dec 17, 2025

math-inc / Sphere-Packing-Lean

Forked from thefundamentaltheor3m/Sphere-Packing-Lean

A Lean formalisation of Maryna Viazovska's Fields Medal-winning solution to the sphere packing problem in dimension 8 and 24.

Lean 63 7 Updated Apr 7, 2026

lumalabs / tvm

Terminal Velocity Matching

Python 83 1 Updated Feb 14, 2026

Dao-AILab / grouped-latent-attention

Python 139 4 Updated May 29, 2025

bernatsalbanya / US-Electric-Distribution-Networks

Python 6 1 Updated Aug 6, 2025

eje24 / iap-diffusion-class

Course website for 6.S184/6.S975: Generative AI with Stochastic Differential Equations

HTML 32 9 Updated Mar 18, 2026

shinfxh / reverso

Python 50 2 Updated Apr 1, 2026

mit-han-lab / fouroversix

Code for the papers: “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling” and “Adaptive Block-Scaled Data Types”

Python 171 17 Updated Apr 21, 2026

dataflowr / gpu_llm_flash-attention

Course on Flash-attention in Triton

Jupyter Notebook 98 9 Updated Feb 9, 2026

windows7lover / DTE-DynamicTrainingEngine

Generic building-block toolbox for training neural networks with adaptive and recursive execution. It provides reusable components to control iteration, stopping, and unrolling during training, ena…

Python 27 Updated Feb 4, 2026

Lyy-iiis / pMF

Official Implementation of pMF https://arxiv.org/abs/2601.22158

Python 215 12 Updated Feb 19, 2026

adh1s / mfm

Official Implementation of "Meta Flow Maps enable scalable reward alignment"

Python 33 1 Updated Mar 14, 2026

lintool / guide

The Student's Guide to @lintool

323 22 Updated Feb 11, 2026

brain2vec / OmnEEG

Simple EEG tokenizer with PyTorch datasets.

Python 5 3 Updated Mar 4, 2026

microsoft / post-training-toolkit

Python 20 1 Updated Jan 28, 2026

Gen-Verse / dLLM-RL

[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.

Python 495 39 Updated Jan 28, 2026

haoyangzheng-ai / didi-instruct

[ICLR 2026] Discrete Diffusion Divergence Instruct (DiDi-Instruct)

Python 153 10 Updated Mar 4, 2026

HyperPotatoNeo / RSA

Python 148 21 Updated Sep 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kashif Rasul kashif

Achievements

Achievements

Highlights

Block or report kashif

Stars

huggingface / ml-intern

microsoft / ArchScale

splunk / cisco-time-series-model

JeanKaddour / tpo

thunlp / OPD

Smlcrm / TempusBench

DBatUTuebingen / DiDi

lewtun / parameter-golf

Joluck / MiSS

Dimillian / Skills

gstohl / godot-mujoco

facebookresearch / dllm_post_training

facebookresearch / wt-asbs

math-inc / Sphere-Packing-Lean

lumalabs / tvm

Dao-AILab / grouped-latent-attention

bernatsalbanya / US-Electric-Distribution-Networks

eje24 / iap-diffusion-class

shinfxh / reverso

mit-han-lab / fouroversix

dataflowr / gpu_llm_flash-attention

windows7lover / DTE-DynamicTrainingEngine

Lyy-iiis / pMF

adh1s / mfm

lintool / guide

brain2vec / OmnEEG

microsoft / post-training-toolkit

Gen-Verse / dLLM-RL

haoyangzheng-ai / didi-instruct

HyperPotatoNeo / RSA