Introduction

There are many definitions of big data. Some define it by scale, some by methods or technology that can be used, and so on.

Directions

In your initial post, come up with your own or adopt (cite) definition for data mining of large data sets from the academic literature (max 5 years old).

Explain what requirements the data has to meet to qualify to be big data. What would make it different from a regular dataset in terms of scale, methods, and technologies that should be used to analyze it? Make sure NOT to limit the basis of your submission to so-called five V’s of big data. Explain what criteria the data must meet in order to be classified as big data. What distinguishes it from a regular dataset in terms of scale, methods, and technologies to be used to analyze it? Make certain that your submission does not rely solely on the so-called “five V’s of big data.”

In your responses find or come up with and an example of a dataset/data analysis that would meet the requirements of at least one student’s definition, but you would not qualify it as data mining of big data. Explain why.

1 primary post – 300 words

Published by
Essays
View all posts