If you would like to see some sample programs showing how to use the Common Crawl corpus, you can see our Example Library here:
https://github.com/commoncrawl/commoncrawl-examples.git
Or, start up an instance of our Amazon AMI and run the examples, too.