12
16/05/2016 Quantitative Methods II (#SOC2031) 1 Quantitative Methods II Seminar #11: Secondary analysis. Big data and open data David Rozas ([email protected] || @drozas || 26AD03)

Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

Embed Size (px)

Citation preview

Page 1: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 1

Quantitative Methods II

Seminar #11: Secondary analysis. Big data and open data

David Rozas ([email protected] || @drozas || 26AD03)

Page 2: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 2

Outline

● Big data● A sociological perspective● Open data● Some examples● Q&A

Page 3: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 3

What is big data?

● Large and complex datasets. Difficult to process with traditional database systems

● Just the beginning!

Source: http://image.slidesharecdn.com/03-130821133227-phpapp02/95/cis13-big-data-analytics-vendor-perspective-insights-from-the-bleeding-edge-5-638.jpg?cb=1377092026

Page 4: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 4

What is big data? 3Vs

● Challenges (Douglas, 2001) come from expansion of:– Volume: amount of

data

– Variety: number of types of data

– Velocity: data processing, produced in real time

Source: http://itknowledgeexchange.techtarget.com/writing-for-business/files/2013/02/BigData.001.jpg

Page 5: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 5

Only technical?

● Used by researchers, companies, governments, ...

● Incredible possibilities (OKFN, 2016)– Understanding global challenges (e.g. climate

change)

– Democratic accountability and governance (e.g. scrutinise governments)

– Science: free access sum of human knowledge, improve our understanding of the World

Page 6: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 6

But...

● Privacy● Surveillance● Commodification of

human behaviour: data selling

Sociological perspectives are necessary

Source: http://bilerico.lgbtqnation.com/images/freefood.jpg

Page 7: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 7

Open data

● Some data should be freely available to everyone to use and republish as they wish

● In the context of Free Software, Open Hardware, Open Access, Free Culture, etc. Source: https://nomadicutopianism.files.wordpress.com/2011/04/picture-71.png

Page 8: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 8

Open definition

● Open definition (2016):– Availability and access: the data must be available as

a whole and at no more than a reasonable reproduction cost, preferably by downloading over the internet.

– Reuse and redistribution: the data must be provided under terms that permit reuse and redistribution including the intermixing with other datasets.

– Universal participation: everyone must be able to use, reuse and redistribute — there should be no discrimination against fields of endeavour or against persons or groups.

Source: https://en.wikipedia.org/wiki/Open_data#/media/File:Open_Data_stickers.jpg

Page 9: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 9

Some examples

● Mapping, Consumer Data Research Centre

● Data from data.gov.uk

● http://maps.cdrc.ac.uk/#/geodemographics/imdcrimee10to15/default/

Page 10: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 10

Some examples

● Sentiment analysis (computational linguistics to identify subjective information in source materials), NC State University

● Data from dev.twitter.com

● https://www.csc.ncsu.edu/faculty/healey/tweet_viz/tweet_app/

Page 11: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 11

References

● Douglas, L. (2001). 3d data management: Controlling data volume, velocity and variety. Gartner. Retrieved, 6.

● OKFN (2016). Open Knowledge. Retrieved 13th May 2016, from https://okfn.org/

● Open Definition (2016). The Open Definition. Retrieved 13th May 2016, from http://opendefinition.org/od/2.1/en/

Page 12: Quantitative Methods II (#SOC2031). Seminar #11: Secondary analysis. Big data and open data

16/05/2016 Quantitative Methods II (#SOC2031) 12

That's all! Questions?

Thanks!

Danke!Grazie!

¡Gracias!Obrigado!

This work is licensed under a Creative CommonsAttribution-ShareALike 4.0 Unported License

excerpt if otherwise noted.To view a copy of this license, please visit:

http://creativecommons.org/licenses/by-sa/4.0/

contact:

[email protected] || www.davidrozas.com

● @drozas