Structured vs. Unstructured Data: The Rise of Data Anarchy
2014-10-05 11:48
489 查看
Data science and business analytics works
with both structured and unstructured data. Yet the future belongs to unstructured or semi-structured data from both internal and external sources.
Total Enterprise Data Growth 2005-2015
IDC estimates the volume of digital data will
grow 40% to 50% per year. By 2020, IDC predicts the number will have reached 40,000 EB, or 40 Zettabytes (ZB). The world’s information is doubling every two years. By 2020 the world will generate 50 times the amount of information and 75 times the number
of information containers.
The massive growth of unstructured or semi-structured
data is amazing and has implications for data warehouse / business intelligence / data analytics architecture and database design. The way we capture, store, analyze, and distribute data is transforming. New technologies like deduplication, compression, and
analysis tools are lowering costs.
.
Structured data gives names to each field
in a database and defines the relationships between the fields. Unstructured data is usually not stored in a relational database (as traditionally defined) where the data model is relevant to the meaning of the data.
.
The Internet of Things (equipping all objects
in the world with identifying devices), blogs, videos, social media, emails, notes from call centers, and all forms of human and computer to computer communications will soon start to produce massive amounts of unstructured or semi-structured data.
.
The trick is to create value by extracting
the right information from both internal and external data sources. That is what the science of data and art of business analytics needs to learn to extract from larger and larger sets of unstructured data.
http://www.datasciencecentral.com/profiles/blogs/structured-vs-unstructured-data-the-rise-of-data-anarchy
with both structured and unstructured data. Yet the future belongs to unstructured or semi-structured data from both internal and external sources.
Total Enterprise Data Growth 2005-2015
IDC estimates the volume of digital data will
grow 40% to 50% per year. By 2020, IDC predicts the number will have reached 40,000 EB, or 40 Zettabytes (ZB). The world’s information is doubling every two years. By 2020 the world will generate 50 times the amount of information and 75 times the number
of information containers.
The massive growth of unstructured or semi-structured
data is amazing and has implications for data warehouse / business intelligence / data analytics architecture and database design. The way we capture, store, analyze, and distribute data is transforming. New technologies like deduplication, compression, and
analysis tools are lowering costs.
.
Structured data gives names to each field
in a database and defines the relationships between the fields. Unstructured data is usually not stored in a relational database (as traditionally defined) where the data model is relevant to the meaning of the data.
.
The Internet of Things (equipping all objects
in the world with identifying devices), blogs, videos, social media, emails, notes from call centers, and all forms of human and computer to computer communications will soon start to produce massive amounts of unstructured or semi-structured data.
.
The trick is to create value by extracting
the right information from both internal and external data sources. That is what the science of data and art of business analytics needs to learn to extract from larger and larger sets of unstructured data.
http://www.datasciencecentral.com/profiles/blogs/structured-vs-unstructured-data-the-rise-of-data-anarchy
相关文章推荐
- 评论数据库Win A Free Copy of Packt’s Managing Multimedia and Unstructured Data in the Oracle Database e-book
- Win A Free Copy of Packt’s Managing Multimedia and Unstructured Data in the Oracle Database e-book
- MIT Data Science Machine Becomes As Intuitive As Humans: Rise Of The Machines?
- What is the key of Data Assimilation?
- BUG: The GetSchemaTable method of the SqlDataReader object returns the wrong column name
- (转载)A Crash Course on the Depths of Win32 Structured Exception Handling
- Sql Script To set the show sort of data ( up or down )
- Release of the Data Access Application Block 3.1
- The size of base data type in Ubuntu-AMD64
- A simple Example of data processing from Excel as the datasource
- The Rise of the Private-Sector Military
- the differences of DataRelation class between 1.1 and 2.0
- A VS.NET add-in to know the content of the any dataset during debugging
- Exploring the Power of Links in Data Mining-韩家炜演讲摘录
- The usage of intellisense in Vs .net 2005
- the differences of DataRelation class between 1.1 and 2.0
- Data on the Outside vs. Data on the Inside
- customize the template of new files in vs.net 2003
- Data format for the interchange of fingerprint
- Fantastic Four-Rise Of The Silver Surfer 的相机