¡Te damos la bienvenida a Scribd!

Big Data&Hadoop

Cargado por

0% encontró este documento útil (0 votos)

35 vistas17 páginas

The number of servers worldwide will grow by 10x - Amount of information managed by enterprise data centers will grow by 50x - number of "files" enterprise data centers handle will grow by 75x. If you want to get started in big data, you need to understand how to use HDFS.

Descripción original:

Derechos de autor

Formatos disponibles

PPT, PDF, TXT o lea en línea desde Scribd

Compartir este documento

Compartir o incrustar documentos

Opciones para compartir

¿Le pareció útil este documento?

¿Este contenido es inapropiado?

Denunciar este documento

Copyright:

Formatos disponibles

Descargue como PPT, PDF, TXT o lea en línea desde Scribd

Marcar por contenido inapropiado

0% encontró este documento útil (0 votos)

35 vistas17 páginas

Big Data&Hadoop

Cargado por

Tata Sairamesh

Copyright:

Formatos disponibles

Descargue como PPT, PDF, TXT o lea en línea desde Scribd

Marcar por contenido inapropiado

Saltar a página

Está en la página 1de 17

Buscar dentro del documento

Course Topics

Week 1 Week 3

Understanding Big Data Introduction to HDFS Playing around with Cluster Data loading Techniques

Analytics using Hive Understanding HIVE QL NoSQL Databases Understanding HBASE

Week 2

Week 4

Map-Reduce Basics, types and formats Use-cases for Map-Reduce Analytics using Pig Understanding Pig Latin

Zookeeper, Sqoop, Flume Debug MapReduce programs in Eclipse. Real world Datasets and Analysis Planning a career in Big Data

What is Big Data?

Facebook Example

Facebook users spend 10.5 billion minutes (almost 20,000 years) online on the social network Facebook has an average of 3.2 billion likes and comments are posted every day.

Twitter Example
Twitter has over 500 million registered users. The USA, whose 141.8 million accounts represents 27.4 percent of all Twitter users, good enough to finish well ahead of Brazil, Japan, the UK and Indonesia. 79% of US Twitter users are more like to recommend brands they follow 67% of US Twitter users are more likely to buy from brands they follow 57% of all companies that use social media for business use Twitter

Other Industrial Usecases

Insurance Healthcare Genome Sequencing Utilities

Hadoop Users

http://wiki.apache.org/hadoop/Po weredBy

Data volume is growing exponentially

Estimated Global Data Volume: 2011: 1.8 ZB

2015: 7.9 ZB
The world's information doubles every two years Over the next 10 years: The number of servers worldwide will grow by 10x Amount of information managed by enterprise data centers will grow by 50x Number of files enterprise data center handle will grow by 75x

Source: http://www.emc.com/leadership/programs/digit al-universe.htm, which was based on the 2011 IDC Digital Universe Study

Un-Structured Data is exploding

Why DFS?
Read 1 TB Data

1 Machine
4 I/O Channels Each Channel 100 MB/s

10 Machines
4 I/O Channels Each Channel 100 MB/s

Why DFS?
Read 1 TB Data

1 Machine
4 I/O Channels Each Channel 100 MB/s

10 Machines
4 I/O Channels Each Channel 100 MB/s

45 Minutes

Why DFS?
Read 1 TB Data

1 Machine
4 I/O Channels Each Channel 100 MB/s

10 Machines
4 I/O Channels Each Channel 100 MB/s

45 Minutes

4.5 Minutes

What Is Distributed File System? (DFS)

What is Hadoop?
Apache Hadoop is a framework that allows for the distributed processing of large data sets across clusters of commodity computers using a simple programming model.

Companies using Hadoop: - Yahoo - Google - Facebook - Amazon - AOL - IBM - And many more at http://wiki.apache.org/hadoop/PoweredBy

Hadoop Eco-System

Hadoop Core Components:

HDFS Hadoop Distributed File System (storage) MapReduce (processing)

Any Questions ? See you in Next class

Thankyou. Sainagaraju vaduka

También podría gustarte

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
De Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Calificación: 4 de 5 estrellas
4/5 (5795)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
De Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Calificación: 4 de 5 estrellas
4/5 (1091)
Never Split the Difference: Negotiating As If Your Life Depended On It
De Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Calificación: 4.5 de 5 estrellas
4.5/5 (838)
Principles: Life and Work
De Everand
Principles: Life and Work
Ray Dalio
Calificación: 4 de 5 estrellas
4/5 (599)
The Glass Castle: A Memoir
De Everand
The Glass Castle: A Memoir
Jeannette Walls
Calificación: 4.5 de 5 estrellas
4.5/5 (1713)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
De Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Calificación: 4 de 5 estrellas
4/5 (895)
Sing, Unburied, Sing: A Novel
De Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Calificación: 4 de 5 estrellas
4/5 (1104)
Grit: The Power of Passion and Perseverance
De Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Calificación: 4 de 5 estrellas
4/5 (588)
Shoe Dog: A Memoir by the Creator of Nike
De Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Calificación: 4.5 de 5 estrellas
4.5/5 (537)
The Perks of Being a Wallflower
De Everand
The Perks of Being a Wallflower
Stephen Chbosky
Calificación: 4.5 de 5 estrellas
4.5/5 (2104)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
De Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Calificación: 4.5 de 5 estrellas
4.5/5 (345)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
De Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Calificación: 4.5 de 5 estrellas
4.5/5 (474)
Bad Feminist: Essays
De Everand
Bad Feminist: Essays
Roxane Gay
Calificación: 4 de 5 estrellas
4/5 (1016)
The Outsider: A Novel
De Everand
The Outsider: A Novel
Stephen King
Calificación: 4 de 5 estrellas
4/5 (1839)
Her Body and Other Parties: Stories
De Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Calificación: 4 de 5 estrellas
4/5 (821)
The Emperor of All Maladies: A Biography of Cancer
De Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Calificación: 4.5 de 5 estrellas
4.5/5 (271)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
De Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Calificación: 4.5 de 5 estrellas
4.5/5 (121)
Angela's Ashes: A Memoir
De Everand
Angela's Ashes: A Memoir
Frank McCourt
Calificación: 4.5 de 5 estrellas
4.5/5 (440)
Brooklyn: A Novel
De Everand
Brooklyn: A Novel
Colm Tóibín
Calificación: 3.5 de 5 estrellas
3.5/5 (1938)
The Little Book of Hygge: Danish Secrets to Happy Living
De Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Calificación: 3.5 de 5 estrellas
3.5/5 (400)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
De Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Calificación: 3.5 de 5 estrellas
3.5/5 (2259)
A Man Called Ove: A Novel
De Everand
A Man Called Ove: A Novel
Fredrik Backman
Calificación: 4.5 de 5 estrellas
4.5/5 (4610)
The Art of Racing in the Rain: A Novel
De Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Calificación: 4 de 5 estrellas
4/5 (4200)
A Tree Grows in Brooklyn
De Everand
A Tree Grows in Brooklyn
Betty Smith
Calificación: 4.5 de 5 estrellas
4.5/5 (1929)
The Yellow House: A Memoir (2019 National Book Award Winner)
De Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Calificación: 4 de 5 estrellas
4/5 (98)
Steve Jobs
De Everand
Steve Jobs
Walter Isaacson
Calificación: 4.5 de 5 estrellas
4.5/5 (806)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
De Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Calificación: 4.5 de 5 estrellas
4.5/5 (266)
Yes Please
De Everand
Yes Please
Amy Poehler
Calificación: 4 de 5 estrellas
4/5 (1891)
The Woman in Cabin 10
De Everand
The Woman in Cabin 10
Ruth Ware
Calificación: 3.5 de 5 estrellas
3.5/5 (2322)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
De Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Calificación: 3.5 de 5 estrellas
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
De Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Calificación: 4.5 de 5 estrellas
4.5/5 (234)
Fear: Trump in the White House
De Everand
Fear: Trump in the White House
Bob Woodward
Calificación: 3.5 de 5 estrellas
3.5/5 (738)
Wolf Hall: A Novel
De Everand
Wolf Hall: A Novel
Hilary Mantel
Calificación: 4 de 5 estrellas
4/5 (3811)
John Adams
De Everand
John Adams
David McCullough
Calificación: 4.5 de 5 estrellas
4.5/5 (2409)
On Fire: The (Burning) Case for a Green New Deal
De Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Calificación: 4 de 5 estrellas
4/5 (74)
The Light Between Oceans: A Novel
De Everand
The Light Between Oceans: A Novel
M.L. Stedman
Calificación: 4.5 de 5 estrellas
4.5/5 (789)
The Unwinding: An Inner History of the New America
De Everand
The Unwinding: An Inner History of the New America
George Packer
Calificación: 4 de 5 estrellas
4/5 (45)
Manhattan Beach: A Novel
De Everand
Manhattan Beach: A Novel
Jennifer Egan
Calificación: 3.5 de 5 estrellas
3.5/5 (792)
The Constant Gardener: A Novel
De Everand
The Constant Gardener: A Novel
John Le Carré
Calificación: 3.5 de 5 estrellas
3.5/5 (104)
0094 MovingNodeToOtherDomain
Documento7 páginas
0094 MovingNodeToOtherDomain
Tata Sairamesh
Aún no hay calificaciones
Rise of ISIS: A Threat We Can't Ignore
De Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Calificación: 3.5 de 5 estrellas
3.5/5 (137)
Little Women
De Everand
Little Women
Louisa May Alcott
Calificación: 4 de 5 estrellas
4/5 (104)
100 Data Science in R Interview Questions and Answers For 2016
Documento56 páginas
100 Data Science in R Interview Questions and Answers For 2016
Tata Sairamesh
100% (2)
Informatica Questions
Documento1 página
Informatica Questions
Tata Sairamesh
Aún no hay calificaciones
15 Essential Python Interview Questions: Data Structures Primitive Types The Heap
Documento144 páginas
15 Essential Python Interview Questions: Data Structures Primitive Types The Heap
Tata Sairamesh
100% (1)
Renewal Premium Acknowledgement: Collecting Branch: E-Mail: Phone: Transaction No.: Date (Time) : Servicing Branch
Documento1 página
Renewal Premium Acknowledgement: Collecting Branch: E-Mail: Phone: Transaction No.: Date (Time) : Servicing Branch
Tata Sairamesh
Aún no hay calificaciones
Informatica Sequence Generation Techniquesv2
Documento0 páginas
Informatica Sequence Generation Techniquesv2
Tata Sairamesh
Aún no hay calificaciones
Using Parameters For General Session Properties
Documento4 páginas
Using Parameters For General Session Properties
Tata Sairamesh
Aún no hay calificaciones
0344-Generating A UUID Using A Java Transformation
Documento3 páginas
0344-Generating A UUID Using A Java Transformation
Tata Sairamesh
Aún no hay calificaciones
0113 ConfiguringPowerCenterResourceMetadataManager
Documento12 páginas
0113 ConfiguringPowerCenterResourceMetadataManager
Tata Sairamesh
Aún no hay calificaciones
0005 SynchronizingObjects Complete
Documento5 páginas
0005 SynchronizingObjects Complete
Tata Sairamesh
Aún no hay calificaciones
Salesforce Apex Language Reference
Documento425 páginas
Salesforce Apex Language Reference
Tata Sairamesh
Aún no hay calificaciones
Autosys Job Management For UNIX
Documento17 páginas
Autosys Job Management For UNIX
Tata Sairamesh
Aún no hay calificaciones
Autorep Command
Documento4 páginas
Autorep Command
Tata Sairamesh
Aún no hay calificaciones
COMPRESS Function in Informatica
Documento3 páginas
COMPRESS Function in Informatica
Tata Sairamesh
Aún no hay calificaciones
Complete Reference To Informatica - Informatica Experienced Interview Questions - Part4
Documento3 páginas
Complete Reference To Informatica - Informatica Experienced Interview Questions - Part4
Tata Sairamesh
Aún no hay calificaciones
Transport Layer: Overview: Our Goal
Documento147 páginas
Transport Layer: Overview: Our Goal
Mafruha Rahman Mariam
Aún no hay calificaciones
Assignment 1 - Unix - Comp 206
Documento6 páginas
Assignment 1 - Unix - Comp 206
kita
Aún no hay calificaciones
Powerbuilder Interview Questions Part 7
Documento18 páginas
Powerbuilder Interview Questions Part 7
Tariq Mahmood
Aún no hay calificaciones
William Stallings Computer Organization and Architecture 7 Edition System Buses
Documento39 páginas
William Stallings Computer Organization and Architecture 7 Edition System Buses
Murtaza Jamali
Aún no hay calificaciones
Distributed Database Application & System
Documento3 páginas
Distributed Database Application & System
SheryaBhatt
Aún no hay calificaciones
DEH-3730MP DEH-3700MP: Operation Manual
Documento90 páginas
DEH-3730MP DEH-3700MP: Operation Manual
Bikshapathi Jangam
Aún no hay calificaciones
Mesh Central 2 User Guide
Documento57 páginas
Mesh Central 2 User Guide
gerardo_gl
Aún no hay calificaciones
Question CH 6
Documento6 páginas
Question CH 6
Christian Notarisa Sipayung
Aún no hay calificaciones
New File
Documento20 páginas
New File
salemail
Aún no hay calificaciones
07 Ethernet Switching Basics
Documento38 páginas
07 Ethernet Switching Basics
Adeyemi
Aún no hay calificaciones
Lab Activity 1 - fp532
Documento7 páginas
Lab Activity 1 - fp532
Eliani Rahman
Aún no hay calificaciones
Mipi Ds Avery
Documento2 páginas
Mipi Ds Avery
AkhilReddy Sankati
Aún no hay calificaciones
F
Documento100 páginas
F
millena fernandes
Aún no hay calificaciones
MRVPN FL - Ovpn
Documento3 páginas
MRVPN FL - Ovpn
Setareh
Aún no hay calificaciones
DX Diag
Documento30 páginas
DX Diag
Gerard Hunton
Aún no hay calificaciones
Perfect Handwriting Practice Sheet PDF
Documento55 páginas
Perfect Handwriting Practice Sheet PDF
Anji Karingu
Aún no hay calificaciones
Pix Trajets
Documento3 páginas
Pix Trajets
VAN DUYSEN
Aún no hay calificaciones
Apache Hive Cookbook - Sample Chapter
Documento27 páginas
Apache Hive Cookbook - Sample Chapter
Packt Publishing
100% (1)
OOP Lab Manual - W01
Documento17 páginas
OOP Lab Manual - W01
Radish Kumar
Aún no hay calificaciones
Quickstart - To Load Data Into Dedicated SQL Pool Using The Copy Activity - Azure Synapse Analytics - Microsoft Docs
Documento11 páginas
Quickstart - To Load Data Into Dedicated SQL Pool Using The Copy Activity - Azure Synapse Analytics - Microsoft Docs
demetrius albuquerque
Aún no hay calificaciones
Dpo Powervp 2015 Vpug
Documento47 páginas
Dpo Powervp 2015 Vpug
Alexander Zaretsky
Aún no hay calificaciones
Teacher's Reference Manual: Subject: Enterprise Java
Documento52 páginas
Teacher's Reference Manual: Subject: Enterprise Java
gaurav dakare
Aún no hay calificaciones
Project On Sports - Club
Documento7 páginas
Project On Sports - Club
Aaron Philip
Aún no hay calificaciones
OBIEE Training Day2 Answers and Dashboards
Documento147 páginas
OBIEE Training Day2 Answers and Dashboards
ravirajapps
Aún no hay calificaciones
Networker Cheetsheet
Documento23 páginas
Networker Cheetsheet
Kishor
100% (1)
WeeBee Z-0002 ZigBee Module Manual
Documento20 páginas
WeeBee Z-0002 ZigBee Module Manual
Anonymous QcYETrNNSy
Aún no hay calificaciones
TNMS CT LCT Mode NCT Mode and at CT Mode
Documento2 páginas
TNMS CT LCT Mode NCT Mode and at CT Mode
Jonatan Soares
Aún no hay calificaciones
Java Lab 1 Print
Documento6 páginas
Java Lab 1 Print
9609762955
Aún no hay calificaciones
Manual 2.0
Documento847 páginas
Manual 2.0
boniatito
Aún no hay calificaciones
XP-rience - Toad 9.7 & Oracle Instant Client 11
Documento3 páginas
XP-rience - Toad 9.7 & Oracle Instant Client 11
Hettys Geulis
Aún no hay calificaciones