Upload
david-rozas
View
265
Download
1
Embed Size (px)
Citation preview
16/05/2016 Quantitative Methods II (#SOC2031) 1
Quantitative Methods II
Seminar #11: Secondary analysis. Big data and open data
David Rozas ([email protected] || @drozas || 26AD03)
16/05/2016 Quantitative Methods II (#SOC2031) 2
Outline
● Big data● A sociological perspective● Open data● Some examples● Q&A
16/05/2016 Quantitative Methods II (#SOC2031) 3
What is big data?
● Large and complex datasets. Difficult to process with traditional database systems
● Just the beginning!
Source: http://image.slidesharecdn.com/03-130821133227-phpapp02/95/cis13-big-data-analytics-vendor-perspective-insights-from-the-bleeding-edge-5-638.jpg?cb=1377092026
16/05/2016 Quantitative Methods II (#SOC2031) 4
What is big data? 3Vs
● Challenges (Douglas, 2001) come from expansion of:– Volume: amount of
data
– Variety: number of types of data
– Velocity: data processing, produced in real time
Source: http://itknowledgeexchange.techtarget.com/writing-for-business/files/2013/02/BigData.001.jpg
16/05/2016 Quantitative Methods II (#SOC2031) 5
Only technical?
● Used by researchers, companies, governments, ...
● Incredible possibilities (OKFN, 2016)– Understanding global challenges (e.g. climate
change)
– Democratic accountability and governance (e.g. scrutinise governments)
– Science: free access sum of human knowledge, improve our understanding of the World
16/05/2016 Quantitative Methods II (#SOC2031) 6
But...
● Privacy● Surveillance● Commodification of
human behaviour: data selling
Sociological perspectives are necessary
Source: http://bilerico.lgbtqnation.com/images/freefood.jpg
16/05/2016 Quantitative Methods II (#SOC2031) 7
Open data
● Some data should be freely available to everyone to use and republish as they wish
● In the context of Free Software, Open Hardware, Open Access, Free Culture, etc. Source: https://nomadicutopianism.files.wordpress.com/2011/04/picture-71.png
16/05/2016 Quantitative Methods II (#SOC2031) 8
Open definition
● Open definition (2016):– Availability and access: the data must be available as
a whole and at no more than a reasonable reproduction cost, preferably by downloading over the internet.
– Reuse and redistribution: the data must be provided under terms that permit reuse and redistribution including the intermixing with other datasets.
– Universal participation: everyone must be able to use, reuse and redistribute — there should be no discrimination against fields of endeavour or against persons or groups.
Source: https://en.wikipedia.org/wiki/Open_data#/media/File:Open_Data_stickers.jpg
16/05/2016 Quantitative Methods II (#SOC2031) 9
Some examples
● Mapping, Consumer Data Research Centre
● Data from data.gov.uk
● http://maps.cdrc.ac.uk/#/geodemographics/imdcrimee10to15/default/
16/05/2016 Quantitative Methods II (#SOC2031) 10
Some examples
● Sentiment analysis (computational linguistics to identify subjective information in source materials), NC State University
● Data from dev.twitter.com
● https://www.csc.ncsu.edu/faculty/healey/tweet_viz/tweet_app/
16/05/2016 Quantitative Methods II (#SOC2031) 11
References
● Douglas, L. (2001). 3d data management: Controlling data volume, velocity and variety. Gartner. Retrieved, 6.
● OKFN (2016). Open Knowledge. Retrieved 13th May 2016, from https://okfn.org/
● Open Definition (2016). The Open Definition. Retrieved 13th May 2016, from http://opendefinition.org/od/2.1/en/
16/05/2016 Quantitative Methods II (#SOC2031) 12
That's all! Questions?
Thanks!
Danke!Grazie!
¡Gracias!Obrigado!
This work is licensed under a Creative CommonsAttribution-ShareALike 4.0 Unported License
excerpt if otherwise noted.To view a copy of this license, please visit:
http://creativecommons.org/licenses/by-sa/4.0/
contact:
● [email protected] || www.davidrozas.com
● @drozas