Skip to content
View LiChangNY's full-sized avatar

Block or report LiChangNY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Google Cloud Storage emulator & testing library.

Go 1,378 268 Updated Apr 27, 2026

Python XML Schema Bindings

Python 130 74 Updated Apr 29, 2023

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning

Scala 2,273 399 Updated Sep 29, 2023

A List of Recommender Systems and Resources

4,823 705 Updated Dec 3, 2025

DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector

Java 152 196 Updated Mar 4, 2024

A shell script to set up a macOS laptop for web and mobile development.

Shell 8,552 1,886 Updated Mar 27, 2026

Track changes to your rails models

Ruby 6,998 910 Updated Oct 24, 2025

Do some browser detection with Ruby. Includes ActionController integration.

Ruby 2,493 364 Updated Jun 10, 2025

Python module installed with setup.py

Python 337 78 Updated Jun 29, 2022

This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/pandas-gbq

Python 490 130 Updated Mar 13, 2026

Samples for the DoubleClick for Advertisers Reporting and Trafficking API

C# 109 172 Updated Mar 16, 2026

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 45,248 16,973 Updated Apr 30, 2026

Repository with examples and smoke tests for the GCP Airflow operators and hooks

Python 152 39 Updated Jan 15, 2017

Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform…

Python 87 35 Updated Feb 11, 2014

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 8,558 4,544 Updated Apr 30, 2026

DonorsChoose.org Data Science Team Opensource Code

Jupyter Notebook 78 24 Updated Dec 8, 2022

Pentaho Data Integration ( ETL ) a.k.a Kettle

Java 8,339 3,580 Updated Apr 30, 2026

Upserts, Deletes And Incremental Processing on Big Data.

Java 6,150 2,480 Updated Apr 30, 2026

Adds static typing to JavaScript to improve developer productivity and code quality.

Rust 22,206 1,885 Updated Apr 30, 2026

Streaming MapReduce with Scalding and Storm

Scala 2,127 259 Updated Jan 19, 2022

Ansible playbook to deploy distributed technologies

Python 67 43 Updated Nov 20, 2017

A short guide for transitioning from Python to Scala

64 28 Updated Jan 5, 2016

Repo to migrate old wiki to, esp for devs and code examples

183 58 Updated Oct 18, 2016

Web UI for PrestoDB.

Java 2,750 444 Updated May 20, 2021

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 72,660 17,166 Updated Apr 30, 2026

Docker image for Airbnb's Superset

Dockerfile 991 415 Updated Dec 22, 2025

Content for Udacity's Machine Learning curriculum

Jupyter Notebook 4,024 6,257 Updated Feb 24, 2022

An extension of GeoJSON that encodes topology! 🌐

JavaScript 4,879 683 Updated Sep 20, 2024