Code Examples

If you would like to see some sample programs showing how to use the Common Crawl corpus, you can see our Example Library here:

  https://github.com/commoncrawl/commoncrawl-examples.git

Or, start up an instance of our Amazon AMI and run the examples, too.

 

Below are links to some code written by other people for use with Common Crawl data:

Web Data Commons
Project description
Code on Assembla

Is Money the Root of All Evil?
Project description
Code on GitHub

Reverse Link
Web app
Code on GitHub

Online Sentiment Towards Congressional Bills
This project correlated Common Crawl data and congressional data to look at the online conversation surrounding individual pieces of legislation. Code on Github