Natural Language Corpus Data: Beautiful Data
Data files are derived from the Google Web Trillion Word Corpus, as described by ... You are free to use this code under the MIT license. To run this code, ...
http://norvig.com/ngrams/
popularity:
nlp
data
python
corpus
language
dataset
datamining
norvig
code
programming
|
Orange
A component based framework for data input/output, preprocessing, predictive ... That does not mean that updated bundles are not for serious things but scripting ...
similarity:
popularity:
python
datamining
software
programming
statistics
ai
opensource
data
visualization
tools
|
Email Datasets: person name disambiguation and threading
No information avaiable
similarity:
popularity:
email
research
enron
dataset
corpus
nlp
data
datamining
datasets
machinelearning
|
Statistical NLP / corpus-based computational linguistics resources
Tools: Machine Translation, POS Taggers, NP chunking, Sequence models, Parsers, ... http://nlp.stanford.edu/links/statnlp.html ...
similarity:
popularity:
nlp
linguistics
tools
corpus
research
language
software
corpora
resources
ai
|
The OpenNLP Homepage
Collaborative organization for open source projects related to natural language processing. Lists ongoing projects and documents proposed standard Java and XML APIs.
similarity:
popularity:
nlp
opensource
java
linguistics
software
programming
opennlp
machinelearning
language
tools
|
Enron Email Dataset
No information avaiable
similarity:
popularity:
email
enron
dataset
data
research
nlp
corpus
business
visualization
mail
|
The OpenNLP Homepage
No information avaiable
similarity:
popularity:
nlp
linguistics
java
tools
software
language
opensource
natural_language_processing
programming
textmining
|
Data Wrangling - machine learning, datamining, algorithms, python code, and more
... results below the chart are highlighted in yellow if they lead to datawrangling.com ... Search map of queries leading to clicks on datawrangling.com ...
similarity:
popularity:
datamining
machinelearning
blog
data
python
programming
statistics
research
analytics
hadoop
|
protobuf - Google Code
[ http://code.google.com/p/protobuf-netbeans-plugin/ NetBeans IDE plugin] * [http://code.google.com/p/protobuf-wireshark/ Wireshark/Ethereal packet ...
similarity:
popularity:
google
programming
opensource
python
data
java
c++
protocol
serialization
code
|
natural language processing blog
Just got back from Israel for ICML, which was a great experience: I'd ... The usual caveats apply (I didn't see everything it's a biased sample, blah blah ...
similarity:
popularity:
nlp
blog
linguistics
ai
language
blogs
research
machine_learning
machinelearning
programming
|
GATE Documentation
No information avaiable
similarity:
popularity:
nlp
datamining
language
programming
semantic
ontology
text
api
gate
textmining
|