finds you similar websites
auto-suggest    top sites

Apr 29th, 2025

16 Popular Sites Like http://people.csail.mit.e...

The team has scanned through the internet and identified a lot of awesome dataset and corpus sites like this one. Stop on by and view more webpages that resemble this one.

Displaying 41 to 50 of 500 alternatives to http://people.csail.mit.edu/jrennie.... (Updated: Apr 29th, 2025)     [about these results]
Advanced Options
? Sort by:
popularity similarity
? Must Include:
? Cannot Include:
? Look For


Sponsored Links
 
You're looking for other sites like :
  Home Page for 20 Newsgroups Data Set
No information avaiable
http://people.csail.mit.edu/jrennie/20Newsgroups/
popularity:
dataset
corpus
data
nlp
datasets
research
machinelearning
newsgroup
clustering
text
new search by a custom tag signature
  言語処理のための機械学習入門を読んだ - 射撃しつつ前転
< UnicornでSinatraアプリをデプ... | 強烈に便利なzshrcの設定 3種盛...> 2010-07-11. 言語処理のための ... 言語処理のための機械学習入門という本が出版される、という話はtwitterで知っていたのだが、8月ぐらいに出るのだろうとばかり思っていたら、なんかもう 発売されているらしい。 ...
similarity:
popularity:
nlp
machinelearning
book
自然言語処理
machine_learning
learning
linguistics
science
research
まとめ
  Tim Davis: UF Sparse Matrix Collection : sparse matrices from a wide range of applications
The University of Florida Sparse Matrix Collection is a large, widely available, and ... The URL: http://www.cise.ufl.edu/research/sparse/matrices, submitted to ACM Trans. on ...
similarity:
popularity:
matrix
math
visualization
sparse
graph
research
data
dataset
matlab
numerical
  The OpenNLP Homepage
Collaborative organization for open source projects related to natural language processing. Lists ongoing projects and documents proposed standard Java and XML APIs.
similarity:
popularity:
nlp
opensource
java
linguistics
software
programming
opennlp
machinelearning
language
tools
  Word frequency lists and frequency dictionary of English
No information avaiable
similarity:
popularity:
corpus
english
dictionary
linguistics
language
reference
corpora
research
data
vocabulary
  SNAP: Stanford Network Analysis Platform
SNAP is also available through the NodeXL which is a graphical front-end that ... A collection of more than 40 large network datasets from tens of thousands of ...
similarity:
popularity:
networks
network
datasets
software
graph
library
data
graphs
analysis
research
  Hal Daumé III - about me
No information avaiable
similarity:
popularity:
people
nlp
machinelearning
haskell
linguistics
researcher
programming
machine-learning
homepage
research
  The Dataverse Network Project | The Dataverse Network Project
VDC provides a complete open-source, digital library system for the management, dissemination, exchange, and ... Login/Register: thedata.org. IQSS Dataverse ...
similarity:
popularity:
data
research
statistics
opensource
software
repository
tools
library
storage
web
  [bnc] British National Corpus
A balanced synchronic text corpus containing 100 million words with morphosyntactic annotation.
similarity:
popularity:
corpus
english
language
dictionary
linguistics
reference
vocabulary
research
writing
british
  Socrata | Making Data Social
No information avaiable
similarity:
popularity:
data
database
statistics
government
social
research
opendata
socialnetworking
web2.0
tools
  OSU Linguistics: Corpus Resources
funding for this project provided by OSU College of Humanities Seed Grant ... New book on Developing Linguistic Corpora:a Guide to Good Practice. TOKENIZATION ...
similarity:
popularity:
corpus
nlp
linguistics
tools
software
list
ai
corpora
parsing
tagging
< prev ... 1 2 3 4 5 6 7 ... 50 next >
Sorting Results
  • This slider determines how the matched sites are sorted.
  • If you want to see the most popular sites that are somewhat related to your search, slide this more towards "popularity."
  • If you want to see the sites that best matched your search, regardless of popularity, slide this towards "similarity."
Must Include Tags
  • Matched sites will not be shown unless they have all of the tags on this list.
  • This feature is useful for when you require a site to have been tagged as something.
  • To add a tag to this list, click "add tag" or click on any tag in a result.
Must Not Include Tags
  • Matched sites that have any tag on this list will not be shown.
  • This feature is useful for filtering out results that have tags you are absolutely not interested in.
  • To add a tag to this list, click "add tag" or click on any tag in a result.
Types of Results
  • This option lets you specify the types of sites to show.
  • If you want to only see domains (www..com), select "domains only."
  • If you want to only see articles (www..com/something/here), select "articles only."
  • If you don't care, or care so much about both, select "Both".
About The Results
an example search result
How moreofit Searches
Each website has a unique tag signature -- a set of words that users have described the website as. Moreofit searches for websites that have similar tag signatures and displays the results.
1: Similarity
A site's "similarity" is determined by how well its tag signature matches the tag signature that is being searched for. A 100% match means that it has the exact same tags in the exact same order, while a 0% match means it has no tags in common.
2: Popularity
The popularity of a website is, well, pretty much self explanatory.
3: Tag Signature
The tag signatures show how a site is described. The deeper the color of the tag, the more frequently the website is tagged as this. Tags underlined blue denote a tag that is in common with the search's tag signature.