finds you similar websites
auto-suggest    top sites

Apr 29th, 2025

16 Popular Sites Like http://people.csail.mit.e...

The team has scanned through the internet and identified a lot of awesome dataset and corpus sites like this one. Stop on by and view more webpages that resemble this one.

Displaying 11 to 20 of 500 alternatives to http://people.csail.mit.edu/jrennie.... (Updated: Apr 29th, 2025)     [about these results]
Advanced Options
? Sort by:
popularity similarity
? Must Include:
? Cannot Include:
? Look For


Sponsored Links
 
You're looking for other sites like :
  Home Page for 20 Newsgroups Data Set
No information avaiable
http://people.csail.mit.edu/jrennie/20Newsgroups/
popularity:
dataset
corpus
data
nlp
datasets
research
machinelearning
newsgroup
clustering
text
new search by a custom tag signature
  LDC Catalog
No information avaiable
similarity:
popularity:
google
nlp
linguistics
n-gram
corpus
data
language
research
statistics
datasets
  LDC - Linguistic Data Consortium
Creates, collects and distributes speech and text databases, lexicons and other resources for research and development purposes.
similarity:
popularity:
linguistics
nlp
language
corpus
research
data
reference
database
tools
corpora
  (theinfo)
theinfo.org. for people with large data sets [log in] [edit] [history] ... theinfo.org is a community site; if you want to help run it, join the mailing list. ...
similarity:
popularity:
data
visualization
datasets
programming
dataset
analysis
web
statistics
datamining
database
  infochimps.org — Find Any Dataset in the World
List of Roots ... Sign up Home About Help Blog Gallery. Infochimps.org is still in beta testing. ... Find any dataset in the world at Infochimps.org ...
similarity:
popularity:
data
statistics
datasets
database
research
datamining
repository
reference
visualization
free
  MNIST handwritten digit database, Yann LeCun and Corinna Cortes
The MNIST database of handwritten digits, available from this page, has a ... The digits have been size-normalized and centered in a fixed-size image. ...
similarity:
popularity:
dataset
ai
ocr
machinelearning
database
mnist
image
data
datasets
programming
  Data Marketplace : Find, buy and sell data online
No information avaiable
similarity:
popularity:
data
marketplace
research
statistics
database
dataset
market
business
stats
information
  Statistical Science Web: Data Sets
Links to many data sets for teaching and research in statistics. ... Stories are classified according to statistical methods and major topics of interest. ...
similarity:
popularity:
statistics
datasets
data
reference
dataset
stats
resources
math
research
resource
  Datamob: Public data put to good use
Datamob highlights the connection between public data sources and the interfaces ... Datamob is licensed under Creative Commons Attribution-Share Alike 3.0 ...
similarity:
popularity:
data
visualization
datasets
statistics
database
api
datamining
public
technology
dataset
  Statistical NLP / corpus-based computational linguistics resources
Tools: Machine Translation, POS Taggers, NP chunking, Sequence models, Parsers, ... http://nlp.stanford.edu/links/statnlp.html ...
similarity:
popularity:
nlp
linguistics
tools
corpus
research
language
software
corpora
resources
ai
  ICPSR
National Archive of Computerized Data on Aging. National Archive of ... National Addiction & HIV Data Archive Program. Substance Abuse and Mental Health Data ...
similarity:
popularity:
data
research
statistics
sociology
icpsr
datasets
politics
education
reference
economics
< prev ... 1 2 3 4 5 ... 50 next >
Sorting Results
  • This slider determines how the matched sites are sorted.
  • If you want to see the most popular sites that are somewhat related to your search, slide this more towards "popularity."
  • If you want to see the sites that best matched your search, regardless of popularity, slide this towards "similarity."
Must Include Tags
  • Matched sites will not be shown unless they have all of the tags on this list.
  • This feature is useful for when you require a site to have been tagged as something.
  • To add a tag to this list, click "add tag" or click on any tag in a result.
Must Not Include Tags
  • Matched sites that have any tag on this list will not be shown.
  • This feature is useful for filtering out results that have tags you are absolutely not interested in.
  • To add a tag to this list, click "add tag" or click on any tag in a result.
Types of Results
  • This option lets you specify the types of sites to show.
  • If you want to only see domains (www..com), select "domains only."
  • If you want to only see articles (www..com/something/here), select "articles only."
  • If you don't care, or care so much about both, select "Both".
About The Results
an example search result
How moreofit Searches
Each website has a unique tag signature -- a set of words that users have described the website as. Moreofit searches for websites that have similar tag signatures and displays the results.
1: Similarity
A site's "similarity" is determined by how well its tag signature matches the tag signature that is being searched for. A 100% match means that it has the exact same tags in the exact same order, while a 0% match means it has no tags in common.
2: Popularity
The popularity of a website is, well, pretty much self explanatory.
3: Tag Signature
The tag signatures show how a site is described. The deeper the color of the tag, the more frequently the website is tagged as this. Tags underlined blue denote a tag that is in common with the search's tag signature.