31
Order from chaos Simon Brown DBI317

Simon Brown. a generic label for describing any corporate information that is not in a database

Embed Size (px)

Citation preview

Order from chaos

Simon Brown

DBI317

What Is Unstructured Data?

What is unstructured data?a generic label for describing any corporate information

that is not in a database

unstructured data : Data that does not reside in fixed locations

any data that has no identifiable structure

information that either does not have a pre-defined data model or is not organized in a pre-defined manner

Structure

Why is unstructured data important?Unstructured data doubles every three months

7 million web pages are added every day

80% of business is conducted on unstructured information

85% of all data stored is held in an unstructured format

What are we working with?

Technology

UnstructuredExcelFile shares/FoldersInternet Data

StructuredSQL Server 2012Analysis ServicesIntegration Services

Semi-structuredSharepointExcel ServicesService Manager

Business IntelligenceSharepoint PowerPivot Power View

Power Query Power Map BI Semantic Model

Meet BobBusiness RequirementsFast report creationTurning personal information into BIWill use outcome to plan special pricing and inventory managementTo be discarded after use

Demo

Retail Measurements

ReviewWhat We DidTook unstructured data joined it to structured data

Bob has used this for personal reporting

Identified this as tactical data

How We Did It

UnstructuredExcelFile shares/FoldersInternet Data

StructuredAnalysis Services

Semi-structured

Business IntelligenceBI Semantic Model

Success

Meet SallyBusiness RequirementsGain insight for planning meetingNeed to understand what current state looks likeNeed to plan for future stateWork out who needs to be engaged in private sector to align planningNeed to know if more funding will be required

Demo

Hospital Growth

ReviewWhat We DidTook structured data combined it

Created a personal data source

Published reports from the data source

Used reports for the basis of a planning presentation

How We Did It

UnstructuredExcelInternet Data

StructuredSQL Server 2012Analysis Services

Semi-structuredSharepointExcel Services

Business IntelligenceSharepoint PowerPivot Power View

Power Query Power Map BI Semantic Model

Success

Meet SimonBusiness RequirementsKeep data for other usesFeel good about house purchaseUnderstand which agent is likely to give the best result if needing to sell

Demo

House Pricing

ReviewWhat We DidTook unstructured data from the internet and transformed to data set

No longer a personal datasource but a consumable datasource

Used ETL to transform to structured data

Use PowerView to visualise data

How We Did It

UnstructuredExcelInternet Data

StructuredSQL Server 2012Integration Services

Semi-structuredSharepointExcel Services

Business IntelligenceSharepoint PowerPivot Power View

Power Query BI Semantic Model

Total Chaos

ChaosWe have over 10,000 Excel spreadsheets in the organisation. I am going to ban Excel.

- Manager

Discovery

Data Source

Change Control

Success

Questions

Developer Network

Resources for Developers

http://msdn.microsoft.com/en-au/

Learning

Virtual Academy

http://www.microsoftvirtualacademy.com/

TechNet

Resources

Sessions on Demand

http://channel9.msdn.com/Events/TechEd/Australia/2013

Resources for IT Professionals

http://technet.microsoft.com/en-au/

Track Resources • Download the CTP for SQL Server 2014 and accelerate your queries

using In-Memory OLTP - http://technet.microsoft.com/en-us/evalcenter/dn205290.aspx

• Get into the cloud with an Azure account - use SQL database in Windows Azure or take your workload into Azure VM - www.windowsazure.com

• Get big with big data – HDInsight on Azure and grab the latest Power BI featureshttp://www.windowsazure.com/en-us/documentation/services/hdinsight/?fb=en-us

Power BI - www.powerbi.com

© 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.