Highlights
- Pro
-
-
word_cloud Public
Python word cloud library for use within Jupyter notebook and Python apps.
-
resources Public
Forked from opinosis-analytics/blog-articlesCurated List of Blog Posts From Opinosis Analytics
UpdatedAug 14, 2021 -
OpinRank Public
OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)
-
nlp-in-practice Public
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, …
-
data-science-blogs Public
Forked from rushter/data-science-blogsA curated list of data science blogs
-
stop-words Public
Stop word lists
-
ROUGE-2.0 Public
ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.
-
opinosis-summarization Public
This repo contains code and dataset for the Opinosis Summarization Framework
-
-
phrase-at-scale Public
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
-
-
SIF_mini_demo Public
Forked from PrincetonML/SIF_mini_demominimal example for sentence embedding by Smooth Inverse Frequency weighting scheme
Python MIT License UpdatedMar 13, 2018 -
text-mining-and-nlp-apis Public
Forked from RxNLP/nlp-cloud-apisAPIs for clustering sentences, extracting topics, counting words & n-grams, extracting text from html or URL, computing similarity between texts and more.
-
pyrxnlp Public
Forked from RxNLP/PyRXNLPSuper simple NLP tools. Cluster sentences, get multiple text similarity measures including cosine, jaccard and dice, generate topics, extract text from html and more
-
clinical-concepts Public
Discovering Related Clinical Concepts using Large Amounts of Clinical Notes. An unsupervised graphical approach to mine related concepts by leveraging the volume within large amounts of clinical no…
-
-
python-examples Public
Working examples in python
-
ROUGE-Utility Public
Utility tools to prepare and evaluate ROUGE scores. Perl script to convert perl output of ROUGE to CSV.
1 UpdatedDec 15, 2017 -
-
-
Dataset for Micropinion Generation. Dataset is based on user reviews from CNET. The reviews are on products from various categories like tv, cell phones, gps etc.
2 UpdatedJul 14, 2017 -
bootstrap Public
Forked from twbs/bootstrapThe most popular HTML, CSS, and JavaScript framework for developing responsive, mobile first projects on the web.
-
electron Public
Forked from electron/electronBuild cross platform desktop apps with JavaScript, HTML, and CSS
C++ MIT License UpdatedNov 4, 2016 -
GeoSpark Public
Forked from apache/sedonaA Cluster Computing System for Processing Large-Scale Spatial Data
Java MIT License UpdatedNov 3, 2016 -
spark-lucenerdd Public
Forked from zouzias/spark-lucenerddSpark RDD with Lucene's query capabilities
Scala Apache License 2.0 UpdatedNov 2, 2016 -
spectron Public
Forked from electron-userland/spectronTest Electron apps using ChromeDriver
JavaScript MIT License UpdatedOct 27, 2016 -
-
spark Public
Forked from apache/sparkMirror of Apache Spark
Scala Apache License 2.0 UpdatedOct 27, 2016 -
stanza Public
Forked from stanfordnlp/stanza-oldStanford NLP group's shared Python tools.
Python Apache License 2.0 UpdatedOct 26, 2016