BigData Analytics: Solution Or Resolution?
3/5
()
About this ebook
In 1997, NASA researchers Michael Cox and David Ellsworth publish “Application-controlled demand paging for out-of-core visualization” in the Proceedings of the IEEE 8th conference on Visualization. They start the article with “Visualization provides an interesting challenge for computer systems: data sets are generally quite large, taxing the capacities of main memory, local disk, and even remote disk. We call this the problem of big data. When data sets do not fit in main memory (in core), or when they do not fit even on local disk, the most common solution is to acquire more resources.” It is the first article in the ACM digital library to use the term “big data.” Michael Lesk publishes “How much information is there in the world?”? Lesk concludes that “There may be a few thousand petabytes of information all told; and the production of tape and disk will reach that level by the year 2000. So in only a few years, (a) we will be able [to] save everything–no information will have to be thrown out, and (b) the typical piece of information will never be looked at by a human being.”
Binayaka Mishra
Binayaka Mishra is an experienced IT professional, in various tools and technologies like Data Warehousing, BigData Analytics, Cloud Computing, Reporting Analytics & Project Management documentation with 14+ years’of experience. He was Graduated in Computer Science & Engineering from National Institute Of Science & Technology, Berhampur, Odisha, India in 2002.He has worked in several critical roles with MNC’s like Tech Mahindra, Oracle Corporation, Wipro Technology,CapGemini UK,CapGemini India Pvt Ltd, UBS , AoN Hewitt Associates India Pvt Ltd, HMRC -UK and TUI Travel Plc -UK. Apart from technical details, his mastery are into functional domains like Payroll Processing, Tax Calculation, UK NI, BFSI,Telecommunication, Corporate Tax measurements divisions, Investment Banking, Automotive, Asset management , Security and Travel & Tourisim.Currently working as a Solution Architect / Project Manager in Tech Mahindra, India, loves to listen to music, play snooker, Bowling and a desperate swimmer like a shark.More Information could be found about him in his Linkedin Profile : https://www.linkedin.com/in/binayaka-mishra-b09612142/For any comments or advise, please feel free to write to: mishra.binayaka.18005@gmail.com
Read more from Binayaka Mishra
Master Data Management Rating: 0 out of 5 stars0 ratingsDigital Technology: The World Of Our Own Rating: 0 out of 5 stars0 ratingsOpen Source ETL-The Prodigy Kids Rating: 1 out of 5 stars1/5The World Of Agile:Incarnation Of DevOps Rating: 0 out of 5 stars0 ratingsCloud Computing: Reign Of Access Rating: 0 out of 5 stars0 ratingsOpen Source Database: Virtue Or Vice? Rating: 0 out of 5 stars0 ratingsOpen Source Software: The Beginning Of A New Era Rating: 0 out of 5 stars0 ratings
Related to BigData Analytics
Related ebooks
Real-time business intelligence A Complete Guide Rating: 0 out of 5 stars0 ratingsBanking on Cloud Data Platforms: A Guide Rating: 0 out of 5 stars0 ratingsMaking Big Data Work for Your Business: A guide to effective Big Data analytics Rating: 0 out of 5 stars0 ratingsSql Mastery: The Masterclass Guide to Become an SQL Expert Rating: 0 out of 5 stars0 ratingsHadoop Real-World Solutions Cookbook - Second Edition Rating: 0 out of 5 stars0 ratingsHadoop Blueprints Rating: 0 out of 5 stars0 ratingsManaging Data in Motion: Data Integration Best Practice Techniques and Technologies Rating: 0 out of 5 stars0 ratingsDataOps A Complete Guide - 2020 Edition Rating: 0 out of 5 stars0 ratingsReal-Time Big Data Analytics Rating: 5 out of 5 stars5/5Unstructured Data Analytics: How to Improve Customer Acquisition, Customer Retention, and Fraud Detection and Prevention Rating: 0 out of 5 stars0 ratingsData Warehousing: Using the Wal-Mart Model Rating: 4 out of 5 stars4/5Perspectives on Data Science for Software Engineering Rating: 5 out of 5 stars5/5HDInsight Essentials - Second Edition Rating: 0 out of 5 stars0 ratingsEnterprise Data Warehouse Third Edition Rating: 0 out of 5 stars0 ratingsBig Data Analytics for Creative Marketers: Money Spinner Rating: 3 out of 5 stars3/5Azure SQL Data Warehouse A Complete Guide - 2020 Edition Rating: 0 out of 5 stars0 ratingsLearn Data Warehousing in 24 Hours Rating: 0 out of 5 stars0 ratingsData Pipelines A Complete Guide - 2021 Edition Rating: 0 out of 5 stars0 ratingsBuilding Big Data Applications Rating: 0 out of 5 stars0 ratingsData Lake Development with Big Data Rating: 0 out of 5 stars0 ratingsCloud Computing… Commoditizing It: The Imperative Venture for Every Enterprise Rating: 0 out of 5 stars0 ratingsData Lakes For Dummies Rating: 0 out of 5 stars0 ratingsTaming The Big Data Tidal Wave: Finding Opportunities in Huge Data Streams with Advanced Analytics Rating: 4 out of 5 stars4/5Deep Learning for Computer Vision with SAS: An Introduction Rating: 0 out of 5 stars0 ratingsSocial Data Analytics: Collaboration for the Enterprise Rating: 1 out of 5 stars1/5Learn Hbase in 24 Hours Rating: 0 out of 5 stars0 ratingsImplementing Database Security and Auditing Rating: 4 out of 5 stars4/5
Software Development & Engineering For You
Grokking Algorithms: An illustrated guide for programmers and other curious people Rating: 4 out of 5 stars4/5PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project Rating: 5 out of 5 stars5/5Learn to Code. Get a Job. The Ultimate Guide to Learning and Getting Hired as a Developer. Rating: 5 out of 5 stars5/5Photoshop For Beginners: Learn Adobe Photoshop cs5 Basics With Tutorials Rating: 0 out of 5 stars0 ratingsPython For Dummies Rating: 4 out of 5 stars4/5SQL For Dummies Rating: 0 out of 5 stars0 ratingsLevel Up! The Guide to Great Video Game Design Rating: 4 out of 5 stars4/5The Inmates Are Running the Asylum (Review and Analysis of Cooper's Book) Rating: 4 out of 5 stars4/5OneNote: The Ultimate Guide on How to Use Microsoft OneNote for Getting Things Done Rating: 1 out of 5 stars1/5Hand Lettering on the iPad with Procreate: Ideas and Lessons for Modern and Vintage Lettering Rating: 4 out of 5 stars4/5Modern C++ for Absolute Beginners: A Friendly Introduction to C++ Programming Language and C++11 to C++20 Standards Rating: 0 out of 5 stars0 ratingsBeginning Programming For Dummies Rating: 4 out of 5 stars4/5Managing Humans: Biting and Humorous Tales of a Software Engineering Manager Rating: 4 out of 5 stars4/5Git Essentials Rating: 4 out of 5 stars4/5How to Write Effective Emails at Work Rating: 4 out of 5 stars4/5Android App Development For Dummies Rating: 0 out of 5 stars0 ratingsLua Game Development Cookbook Rating: 0 out of 5 stars0 ratingsiPhone Application Development For Dummies Rating: 4 out of 5 stars4/5The Essential Persona Lifecycle: Your Guide to Building and Using Personas Rating: 4 out of 5 stars4/5Tiny Python Projects: Learn coding and testing with puzzles and games Rating: 5 out of 5 stars5/5How Do I Do That In InDesign? Rating: 5 out of 5 stars5/5Adobe Illustrator CC For Dummies Rating: 5 out of 5 stars5/5Flow: A Handbook for Change-Makers, Mavericks, Innovators and Leaders Rating: 0 out of 5 stars0 ratingsHow Do I Do That in Photoshop?: The Quickest Ways to Do the Things You Want to Do, Right Now! Rating: 4 out of 5 stars4/5Data Visualization: a successful design process Rating: 4 out of 5 stars4/5Reversing: Secrets of Reverse Engineering Rating: 4 out of 5 stars4/5Agile Practice Guide Rating: 4 out of 5 stars4/5
Reviews for BigData Analytics
1 rating0 reviews
Book preview
BigData Analytics - Binayaka Mishra
Big Data Analytics-Solution Or Resolution?
By Binayaka Mishra
Copyright 2017, Binayaka Mishra
Smashwords Edition
This ebook is licensed for your personal enjoyment only. This ebook may not be re-sold
or given away to other people. If you would like to share this book with another person,
please purchase an additional copy for each recipient. If you’re reading this book and did
not purchase it, or it was not purchased for your use only, then please return to your
favorite ebook retailer and purchase your own copy. Thank you for respecting the hard
work of this author.
Table of CONTENTS
Acknowledgements
Dedication
Chapter 1: Known Facts
Chapter 2: BigData Implementation
2.1 Stage 1
2.2 Stage 2
2.3 Stage 3
Chapter 3: BigData Architecture
Chapter 4: BigData Use Cases
Chapter 5: BigData – BigBang Queries
Chapter 6: BigData – Best Practice Solutions
Chapter 7: BigData – ToolSets
Chapter 8: BigData – Challenges & Issues
Chapter 9: BigData – Cloud Computing
Chapter 10: BigData – Digital Transformation
Chapter 11: Conclusion
Chapter 12: Citations
Chapter 13: References
About Binayaka Mishra
ACKNOWLEDGMENTS
This paper is for informational purposes only. THIS DOCUMENT IS PROVIDED AS IS
WITH NO WARRANTIES WHATSOEVER, INCLUDING ANY WARRANTY OF MERCHANTABILITY, NONINFRINGEMENT, FITNESS FOR ANY PARTICULAR PURPOSE, OR ANY WARRANTY OTHERWISE ARISING OUT OF ANY PROPOSAL, SPECIFICATION, OR SAMPLE. The author of this paper disclaims all liability, including liability for infringement of any property rights, relating to use of this information. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted herein. I would like to personally thanks the various Informational Journals as furnished on Reference
& Citation
to aid me produce such quality content and also let me learn new ideas.
DEDICATION
Salim_BhaiToday, I would like to honour this manuscript to one of my finest mentor as well as coach Mr Salim Raza Khan, for his outstanding contribution to my technical and personal environments. Under his cordial guidance on CapGemini, I was blissed to learn most of the traits of career growth and attitude management. Furthermore in one of the tough timing, when I had a fever of 104 degree Fahrenheit at the time of Production Delivery, which requires me to stay on the official floor for continuous 60 hours, it was him who stood beside to me. Should I still go on mentioning the versatile ways he taught me all the lessons, there shall be no end. I must say, he’s everything for me and shall always stand by my heart.
Chapter 1: Known Facts
The best part to show up anything for marketing could vary from person to person being the retrospective visionary which could require to some exceptional portrayal, but the best part to grab the attention of the audience is to make it gentle while you make any moves with a glimpse of surprise. Should I necessarily define the reason to mention the previous statement? Perhaps yes for most of you to make you realize even if you have it in advance by your instincts. We are about to go back to that time where most of the things were barbaric and people were believing in reigning the winning streak, but it was the same time where evidence said, there were people who were much more creative to provide us so much of assets, which made our life little easier today. Furthermore, once such stuff is Big Data invention as understood in the time of 2000, but its birth was happened way too back in the time of Babylonian civilization.
C 18,000 BCE
The earliest examples we have of humans storing and analysing data are the tally sticks. The Ishango Bone was discovered in 1960 in what is now Uganda and is thought to be one of the earliest pieces of evidence of prehistoric data storage. Palaeolithic tribes people would mark notches into sticks or bones, to keep track of trading activity or supplies. They would compare sticks and notches to carry out rudimentary calculations, enabling them to make predictions such as how long their food supplies would last.
C 2400 BCE
The abacus – the first dedicated device constructed specifically for performing calculations – comes into use in Babylon. The first libraries also appeared around this time, representing our first attempts at mass data storage.
300 BC - 48 AD
The Library of Alexandria is perhaps the largest collection of data in the ancient world, housing up to perhaps half a million scrolls and covering everything we had learned so far, about pretty much everything. Unfortunately, in 48AD it is thought to have been destroyed by the invading Romans, perhaps accidentally. Contrary to common myth, not everything was lost – significant parts of the library’s collections were moved to other buildings in the city, or stolen and dispersed throughout the ancient world.
C 100 – 200 AD
The Antikythera Mechanism, the earliest discovered mechanical computer, is produced, presumably by Greek scientists. It’s CPU
consists of 30 interlocking bronze gears and it is thought to have been designed for astrological purposes and tracking the cycle of Olympic Games. Its design suggests it is probably an evolution of an earlier device – but these so far remain undiscovered.
The 1880 U.S. Census took eight years to tabulate, and it was estimated that the 1890 census would take more than 10 years using the then-available methods. Without any advancement in methodology, tabulation would not have been complete before the 1900 census had to be taken.in 1881, The influx of census data led to the invention of the Hollerith tabulating machine (punch cards), which tamed
the big data and let them complete the job in about a year. It turned Hollerith into an entrepreneur and his company eventually became part of what we know as IBM.
By 1932, Information overload continued with the boom in the US population, the issuing of social security numbers, and the general growth of knowledge (research) which demanded more thorough and organized record-keeping. Libraries, the original source of data organization and storage, had to adapt their storage methods to meet the quickly increasing demand of new publications and research. Scholars began referring to this incredible expansion of information as the information explosion.
First referenced by the Lawton Constitution (newspaper) in 1941, the term was expanded upon in a New Statesman article in March 1964, which referred to the difficulty of managing the amount of information available. The first attempts to quantify the growth rate in the volume of data or what has popularly been known as the information explosion
(a term first used in 1941, according to the Oxford English Dictionary). The first flag of warning on the growth of knowledge as a coming storage and retrieval problem came in 1944, when Fremont Rider, a Wesleyan University librarian, estimated that American University Libraries were doubling in size every sixteen years. Given this growth rate, Rider estimated that the Yale Library in 2040 would have approximately 200,000,000 volumes, which will occupy over 6,000 miles of shelves. Claude Shannon published A Mathematical Theory of Communication which established a framework for determining the minimal data requirements to transmit information over noisy (imperfect) channels. This was a landmark work that enabled much of today’s infrastructure. Without this understanding, data would be
bigger than it is today. It followed Nyquist’s
Certain Factors Affecting Telegraph Speed", which sounds far afield, but Nyquist’s work enabled us to sample analog signals and represent them digitally, which is the foundation of modern data processing.
Figure 1.1: Communication System Protocol
In 1956, The concept of virtual memory was developed by German physicist Fritz-Rudolf Güntsch as an idea that treated finite storage as infinite. Storage, managed by integrated hardware and software to hide the details from the user, permitted us to process data without the hardware memory constraints that previously forced the problem to be partitioned (making the solution a reflection of the hardware architecture, a most unnatural act). In 1958, IBM