Discover millions of ebooks, audiobooks, and so much more with a free trial

Only $11.99/month after trial. Cancel anytime.

BigData Analytics: Solution Or Resolution?
BigData Analytics: Solution Or Resolution?
BigData Analytics: Solution Or Resolution?
Ebook146 pages1 hour

BigData Analytics: Solution Or Resolution?

Rating: 3 out of 5 stars

3/5

()

Read preview

About this ebook

In 1997, NASA researchers Michael Cox and David Ellsworth publish “Application-controlled demand paging for out-of-core visualization” in the Proceedings of the IEEE 8th conference on Visualization. They start the article with “Visualization provides an interesting challenge for computer systems: data sets are generally quite large, taxing the capacities of main memory, local disk, and even remote disk. We call this the problem of big data. When data sets do not fit in main memory (in core), or when they do not fit even on local disk, the most common solution is to acquire more resources.” It is the first article in the ACM digital library to use the term “big data.” Michael Lesk publishes “How much information is there in the world?”? Lesk concludes that “There may be a few thousand petabytes of information all told; and the production of tape and disk will reach that level by the year 2000. So in only a few years, (a) we will be able [to] save everything–no information will have to be thrown out, and (b) the typical piece of information will never be looked at by a human being.”

LanguageEnglish
Release dateJun 26, 2017
ISBN9781370957200
BigData Analytics: Solution Or Resolution?
Author

Binayaka Mishra

Binayaka Mishra is an experienced IT professional, in various tools and technologies like Data Warehousing, BigData Analytics, Cloud Computing, Reporting Analytics & Project Management documentation with 14+ years’of experience. He was Graduated in Computer Science & Engineering from National Institute Of Science & Technology, Berhampur, Odisha, India in 2002.He has worked in several critical roles with MNC’s like Tech Mahindra, Oracle Corporation, Wipro Technology,CapGemini UK,CapGemini India Pvt Ltd, UBS , AoN Hewitt Associates India Pvt Ltd, HMRC -UK and TUI Travel Plc -UK. Apart from technical details, his mastery are into functional domains like Payroll Processing, Tax Calculation, UK NI, BFSI,Telecommunication, Corporate Tax measurements divisions, Investment Banking, Automotive, Asset management , Security and Travel & Tourisim.Currently working as a Solution Architect / Project Manager in Tech Mahindra, India, loves to listen to music, play snooker, Bowling and a desperate swimmer like a shark.More Information could be found about him in his Linkedin Profile : https://www.linkedin.com/in/binayaka-mishra-b09612142/For any comments or advise, please feel free to write to: mishra.binayaka.18005@gmail.com

Read more from Binayaka Mishra

Related to BigData Analytics

Related ebooks

Software Development & Engineering For You

View More

Related articles

Reviews for BigData Analytics

Rating: 3 out of 5 stars
3/5

1 rating0 reviews

What did you think?

Tap to rate

Review must be at least 10 words

    Book preview

    BigData Analytics - Binayaka Mishra

    Big Data Analytics-Solution Or Resolution?

    By Binayaka Mishra

    Copyright 2017, Binayaka Mishra

    Smashwords Edition

    This ebook is licensed for your personal enjoyment only. This ebook may not be re-sold

    or given away to other people.  If you would like to share this book with another person,

    please purchase an additional copy for each recipient.  If you’re reading this book and did

    not purchase it, or it was not purchased for your use only, then please return to your

    favorite ebook retailer and purchase your own copy.  Thank you for respecting the hard

    work of this author.

    Table of CONTENTS

    Acknowledgements

    Dedication

    Chapter 1: Known Facts

    Chapter 2: BigData Implementation

    2.1 Stage 1

    2.2 Stage 2

    2.3 Stage 3

    Chapter 3: BigData Architecture

    Chapter 4: BigData Use Cases

    Chapter 5: BigData – BigBang Queries

    Chapter 6: BigData – Best Practice Solutions

    Chapter 7: BigData – ToolSets

    Chapter 8: BigData – Challenges & Issues

    Chapter 9: BigData – Cloud Computing

    Chapter 10: BigData – Digital Transformation

    Chapter 11: Conclusion

    Chapter 12: Citations

    Chapter 13: References

    About Binayaka Mishra

    ACKNOWLEDGMENTS

    This paper is for informational purposes only. THIS DOCUMENT IS PROVIDED AS IS WITH NO WARRANTIES WHATSOEVER, INCLUDING ANY WARRANTY OF MERCHANTABILITY, NONINFRINGEMENT, FITNESS FOR ANY PARTICULAR PURPOSE, OR ANY WARRANTY OTHERWISE ARISING OUT OF ANY PROPOSAL, SPECIFICATION, OR SAMPLE. The author of this paper disclaims all liability, including liability for infringement of any property rights, relating to use of this information. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted herein. I would like to personally thanks the various Informational Journals as furnished on Reference & Citation to aid me produce such quality content and also let me learn new ideas.

    DEDICATION

    Salim_Bhai

    Today, I would like to honour this manuscript to one of my finest mentor as well as coach Mr Salim Raza Khan, for his outstanding contribution to my technical and personal environments. Under his cordial guidance on CapGemini, I was blissed to learn most of the traits of career growth and attitude management. Furthermore in one of the tough timing, when I had a fever of 104 degree Fahrenheit at the time of Production Delivery, which requires me to stay on the official floor for continuous 60 hours, it was him who stood beside to me. Should I still go on mentioning the versatile ways he taught me all the lessons, there shall be no end. I must say, he’s everything for me and shall always stand by my heart.

    Chapter 1: Known Facts

    The best part to show up anything for marketing could vary from person to person being the retrospective visionary which could require to some exceptional portrayal, but the best part to grab the attention of the audience is to make it gentle while you make any moves with a glimpse of surprise. Should I necessarily define the reason to mention the previous statement? Perhaps yes for most of you to make you realize even if you have it in advance by your instincts. We are about to go back to that time where most of the things were barbaric and people were believing in reigning the winning streak, but it was the same time where evidence said, there were people who were much more creative to provide us so much of assets, which made our life little easier today. Furthermore, once such stuff is Big Data invention as understood in the time of 2000, but its birth was happened way too back in the time of Babylonian civilization.

    C 18,000 BCE

    The earliest examples we have of humans storing and analysing data are the tally sticks. The Ishango Bone was discovered in 1960 in what is now Uganda and is thought to be one of the earliest pieces of evidence of prehistoric data storage. Palaeolithic tribes people would mark notches into sticks or bones, to keep track of trading activity or supplies. They would compare sticks and notches to carry out rudimentary calculations, enabling them to make predictions such as how long their food supplies would last.

    C 2400 BCE

    The abacus – the first dedicated device constructed specifically for performing calculations – comes into use in Babylon. The first libraries also appeared around this time, representing our first attempts at mass data storage.

    300 BC - 48 AD

    The Library of Alexandria is perhaps the largest collection of data in the ancient world, housing up to perhaps half a million scrolls and covering everything we had learned so far, about pretty much everything. Unfortunately, in 48AD it is thought to have been destroyed by the invading Romans, perhaps accidentally. Contrary to common myth, not everything was lost – significant parts of the library’s collections were moved to other buildings in the city, or stolen and dispersed throughout the ancient world.

    C 100 – 200 AD

    The Antikythera Mechanism, the earliest discovered mechanical computer, is produced, presumably by Greek scientists. It’s CPU consists of 30 interlocking bronze gears and it is thought to have been designed for astrological purposes and tracking the cycle of Olympic Games. Its design suggests it is probably an evolution of an earlier device – but these so far remain undiscovered.

    The 1880 U.S. Census took eight years to tabulate, and it was estimated that the 1890 census would take more than 10 years using the then-available methods. Without any advancement in methodology, tabulation would not have been complete before the 1900 census had to be taken.in 1881, The influx of census data led to the invention of the Hollerith tabulating machine (punch cards), which tamed the big data and let them complete the job in about a year. It turned Hollerith into an entrepreneur and his company eventually became part of what we know as IBM.

    By 1932, Information overload continued with the boom in the US population, the issuing of social security numbers, and the general growth of knowledge (research) which demanded more thorough and organized record-keeping. Libraries, the original source of data organization and storage, had to adapt their storage methods to meet the quickly increasing demand of new publications and research. Scholars began referring to this incredible expansion of information as the information explosion. First referenced by the Lawton Constitution (newspaper) in 1941, the term was expanded upon in a New Statesman article in March 1964, which referred to the difficulty of managing the amount of information available. The first attempts to quantify the growth rate in the volume of data or what has popularly been known as the information explosion (a term first used in 1941, according to the Oxford English Dictionary). The first flag of warning on the growth of knowledge as a coming storage and retrieval problem came in 1944, when Fremont Rider, a Wesleyan University librarian, estimated that American University Libraries were doubling in size every sixteen years. Given this growth rate, Rider estimated that the Yale Library in 2040 would have approximately 200,000,000 volumes, which will occupy over 6,000 miles of shelves. Claude Shannon published A Mathematical Theory of Communication which established a framework for determining the minimal data requirements to transmit information over noisy (imperfect) channels. This was a landmark work that enabled much of today’s infrastructure. Without this understanding, data would be bigger than it is today. It followed Nyquist’s Certain Factors Affecting Telegraph Speed", which sounds far afield, but Nyquist’s work enabled us to sample analog signals and represent them digitally, which is the foundation of modern data processing.

    Figure 1.1: Communication System Protocol

    In 1956, The concept of virtual memory was developed by German physicist Fritz-Rudolf Güntsch as an idea that treated finite storage as infinite. Storage, managed by integrated hardware and software to hide the details from the user, permitted us to process data without the hardware memory constraints that previously forced the problem to be partitioned (making the solution a reflection of the hardware architecture, a most unnatural act). In 1958, IBM

    Enjoying the preview?
    Page 1 of 1