Course Material
Pattern Slides
<ul><li>pattern.pdf</li></ul><div>Lorenzo (open data) presentation</div><div><ul><li>http://www.slideshare.net/lorenzobenussi/what-is-opendata</li></ul></div><div>
</div><div><div>Occupy Wall Street Historical Twitter Data</div><div>1) Annotated data</div><div>Each row has: tweet, date, time, language, keyword, keyword score, polarity, subjectivity, profanity, retweets. The RT set contains English tweets that have been retweeted at least once, it is only 4MB and ideal for testing. The file “ows.py” is a Pattern script with a couple of examples of how to mine the data (for example, top negative tweets, top dirty tweets, tweets containing “arrest”, …) The keyword score can be used to interpret the “uniqueness” of the tweet. The polarity is based on positive/negative adjectives. However, since each Occupy Wall Street tweet could be seen as somewhat negative/revolutionary, you might experiment with only considering polarity > 0.5 as being positive.</div><div><ul style="margin-top: 0px; margin-right: 0px; margin-bottom: 0.75em; margin-left: 20px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; list-style-type: disc; list-style-position: outside; list-style-image: initial; background-repeat: no-repeat repeat; "><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; ">OWS-sentiment.csv.zip (full, 300,000 tweets, 2011/11/12 - 2011/11/17)</li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; ">OWS-RT-sentiment.csv.zip</li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; ">ows.py.zip</li></ul><div>
</div></div><div>2) Raw data</div><ul style="margin-top: 0px; margin-right: 0px; margin-bottom: 0.75em; margin-left: 20px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; list-style-type: disc; list-style-position: outside; list-style-image: initial; background-repeat: no-repeat repeat; "><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; ">ows.csv.zip -> last version (it works with Pattern)</li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; ">ows-days-retweet.csv -> number of total retweets per day</li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; ">12_17_no_RT.csv.zip</li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; ">12_17_RT_count.csv.zip -> Number of retweets for each tweet</li><li style="margin-top: 0px; margin-right: 0px; margin-bottom: 0px; margin-left: 0px; border-top-width: 0px; border-right-width: 0px; border-bottom-width: 0px; border-left-width: 0px; border-style: initial; border-color: initial; padding-top: 0px; padding-right: 0px; padding-bottom: 0px; padding-left: 0px; font-size: 1em; font-weight: normal; ">ows-sample.txt -> raw data sample from OccupyResearch Collective</li></ul></div><div>
</div><div>Custom Nodes</div><div><ul><li>Arc</li><li>phyllotaxis.ndbx</li><li>NodeBox Logo</li></ul><div>
</div></div><div>OpenPolis Data</div><div><ul><li>gruppo-1 (Region,age,gender,edcation,count).csv</li><li>gruppo-2e3 (age,gender,education_level,institution,profession,count).csv</li><li>total.csv.zip</li><li>gruppo-1.csv</li><li>gruppo-1-solo-aosta.csv</li></ul></div><div>
</div>