Skip to content
View xlwangDev's full-sized avatar

Block or report xlwangDev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Powerful menu bar manager for macOS

Swift 27,514 677 Updated Sep 20, 2025

A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…

22,342 2,305 Updated Dec 12, 2025

AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs

TypeScript 44,185 4,182 Updated Apr 23, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 362,927 74,173 Updated Apr 23, 2026

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.

Python 300 20 Updated Mar 13, 2026

HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model

Python 94 4 Updated Jul 17, 2025

Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback

Python 263 10 Updated Jan 24, 2026

[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 795 31 Updated Feb 10, 2026

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 4,809 1,058 Updated Sep 4, 2025

Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer

Python 142 5 Updated Oct 14, 2025

Some Conferences' accepted paper lists (including AI, ML, Robotic)

Python 1,342 84 Updated Jan 23, 2025

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 3,106 272 Updated Apr 23, 2026

Extracted system prompts from ChatGPT (GPT-5.4, GPT-5.3, Codex), Claude (Opus 4.6, Sonnet 4.6, Claude Code), Gemini (3.1 Pro, 3 Flash, CLI), Grok (4.2, 4), Perplexity, and more. Updated regularly.

38,767 6,377 Updated Apr 22, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 91,298 14,052 Updated Apr 16, 2026

Awesome Unified Multimodal Models

1,215 39 Updated Mar 24, 2026

[🚀ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!

Python 625 16 Updated May 1, 2025

Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.

Jupyter Notebook 648 58 Updated Mar 17, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,040 1,738 Updated Jan 30, 2026

A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of-the-art methods, innovative applications, and key advanceme…

205 5 Updated Apr 12, 2026