Skip to content

Commit bd2bb54

Browse files
committed
improve
1 parent 1d71392 commit bd2bb54

File tree

1 file changed

+2
-5
lines changed

1 file changed

+2
-5
lines changed

README.md

Lines changed: 2 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -745,7 +745,6 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php).
745745

746746
*Libraries for Machine Learning. See: [awesome-machine-learning](https://github.com/josephmisiti/awesome-machine-learning#python).*
747747

748-
* [gensim](https://github.com/RaRe-Technologies/gensim) - Topic Modelling for Humans.
749748
* [Metrics](https://github.com/dmlc/xgboost) - Machine learning evaluation metrics.
750749
* [NuPIC](https://github.com/numenta/nupic) - Numenta Platform for Intelligent Computing.
751750
* [scikit-learn](http://scikit-learn.org/) - The most popular Python library for Machine Learning.
@@ -757,11 +756,9 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php).
757756

758757
*Frameworks and libraries for MapReduce.*
759758

760-
* [dpark](https://github.com/douban/dpark) - Python clone of Spark, a MapReduce alike framework in Python.
761-
* [dumbo](https://github.com/klbostee/dumbo) - Python module that allows one to easily write and run Hadoop programs.
759+
* [PySpark](https://pypi.python.org/pypi/pyspark/) - Apache Spark Python API.
762760
* [luigi](https://github.com/spotify/luigi) - A module that helps you build complex pipelines of batch jobs.
763761
* [mrjob](https://github.com/Yelp/mrjob) - Run MapReduce jobs on Hadoop or Amazon Web Services.
764-
* [PySpark](http://spark.apache.org/docs/latest/programming-guide.html) - The Spark Python API.
765762
* [streamparse](https://github.com/Parsely/streamparse) - Run Python code against real-time streams of data. Integrates with [Apache Storm](http://storm.apache.org/).
766763

767764
## Microsoft Windows
@@ -788,14 +785,14 @@ Inspired by [awesome-php](https://github.com/ziadoz/awesome-php).
788785

789786
*Libraries for working with human languages.*
790787

788+
* [gensim](https://github.com/RaRe-Technologies/gensim) - Topic Modelling for Humans.
791789
* [Jieba](https://github.com/fxsjy/jieba) - Chinese text segmentation.
792790
* [langid.py](https://github.com/saffsd/langid.py) - Stand-alone language identification system.
793791
* [NLTK](http://www.nltk.org/) - A leading platform for building Python programs to work with human language data.
794792
* [Pattern](http://www.clips.ua.ac.be/pattern) - A web mining module for the Python.
795793
* [SnowNLP](https://github.com/isnowfy/snownlp) - A library for processing Chinese text.
796794
* [spaCy](https://spacy.io/) - A library for industrial-strength natural language processing in Python and Cython.
797795
* [TextBlob](https://github.com/sloria/TextBlob) - Providing a consistent API for diving into common NLP tasks.
798-
* [TextGrocery](https://github.com/2shou/TextGrocery) - A simple, efficient short-text classification tool based on LibLinear and Jieba.
799796

800797
## Network Virtualization
801798

0 commit comments

Comments
 (0)