Documentos de Académico
Documentos de Profesional
Documentos de Cultura
SAP FORUM
February 2016
Intel Inside. Powerful Solution Outside.
More information: www.descubrefujitsu.com/SAPforum
FTS INTERNAL
Powered by Intel
Xeon processor.
Rumbo 2020
HANA &
HADOOP
Intro
INDICE
Managed Service Pay per use model for HANA & Hadoop
FTS INTERNAL
Powered by Intel
Xeon processor.
Copyright 2014 FUJITSU LIMITED
Powered by Intel
Xeon processor.
2015 FUJITSU
Complexity
Performance
Enterprise Core
Systems
Unable to work
together
Big Data
Frameworks &
Tools
.
Objetives : Standarize, simplify and Automate both worlds.
Intel Inside. Powerful Solution Outside.
FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
APACHE HADOOP is open source software that enables reliable, scalable, distributed
computing on clusters of inexpensive servers
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
Cost efficient data storage and processing for large volumes of structured, semi-structured
and unstructured data such as web logs, machine data, text data, call data records, audio,
video data.
BATCH PROCESSING
Where fast response times are less critical than reliability ad scalability
COMPLEX INFORMATION PROCESSING: Enable heavily recursive algorithms, machine learning &
queries that cannot be easily expressed in SQL
LOW VALUE DATA ARCHIVE: Data stays available, though access is slower. Scale up to Petabytes
POST-HOC ANALYSIS: Mine raw data that is either schema-less or where schema changes over time
Powered by Intel
Xeon processor.
2015 FUJITSU
YAHOO
TWITTER
Twitter uses Hadoop for product
analysis, social graph analysis,
generating indices for people search,
natural language processing and
many other applications
Powered by Intel
Xeon processor.
2015 FUJITSU
SAP HANA
Data Architecture
Data Structures
No predefined schema
Performance
Scalability
Data Consistency
Licensing costs
OLTP
No OLTP
Excellent OLTP
OLAP
Slow OLAP
Excellent OLAP
Server Failover
Small
Excellent
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
Connection to HANA
SMART DATA ACCESS ( SDA)
Benefits
Enables access to remote data access just like
local table
Smart query processing including query
decomposition with predicate push-down,
functional compensation
Supports data location agnostic development
No special syntax to access heterogeneous
data sources
Not restricted only to Hadoop
Heterogeneous data sources
Oracle, MS SQL, Teradata, DB2, Netezza
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
Spark
APACHE SPARK
Unlike Hadoop, supports batch and steaming Analysis --> Single Framework for
batch and near real time use cases
Spark requires a
1)Cluster Management :standalone, Hadoop YARN, Apache .
2) Distributed Storage System : supports HDFS, Cassandra,
Openstack Swift, Amazon S3 -
If you are going to start with Hadoop now, you should do it with Spark
Intel Inside. Powerful Solution Outside.
FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
HANA Vora is an in-memory query engine which leverages and extends the Apache Spark
execution framework to provide enriched interactive analytics on Hadoop.
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
Key Scenarios
Powered by Intel
Xeon processor.
Copyright 2014 FUJITSU LIMITED
Copyright 2014 FUJITSU
2015 LIMITED
FUJITSU
Key Scenarios
Example of Scenarios
Flexible data store Using Hadoop as a flexible store of data captured from multiple sources,
including SAP and non-SAP software, enterprise software, and externally sourced data
Simple database Using Hadoop as a simple database for storing and retrieving data in very large
data sets
Processing engine Using the computation engine in Hadoop to execute business logic or some
other process
Data analytics Mining data held in Hadoop for business intelligence and analytics
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
DESCRIPTION
SAMPLE USE
CASES
COMMENT
Social Media
Comments on
products on Twitter,
Facebook, and
Amazon
Data Stream
Capture
Data Archive
Archive Data or
computer systems
logs
OLTP Transaction
Data
Long-term persistence of
transactional data from
historical online transaction
processing (OLTP)
Call center,
inventory..
Powered by Intel
Xeon processor.
2015 FUJITSU
DESCRIPTION
Reference Data
E-mail histories
Fulfillment of legal
requirements for e-mail
persistence and for use in
analytics
Capture of business
documents generated and
received by business.
BLOBS
Powered by Intel
Xeon processor.
2015 FUJITSU
Use Hadoop as a data processing engine for ETL rationalization to feed SAP HANA
Feed results to SAP HANA with Data Services and merge with conformed model
Powered by Intel
Xeon processor.
2015 FUJITSU
DESCRIPTION
ETL Rationalization
Identify differences
DNA Analysis
Hadoop using
Mapreduce
Risk Analysis
Da
Data Mining
COMMENT
Require Mahout
Powered by Intel
Xeon processor.
2015 FUJITSU
Hadoop storage is sometimes so high that cant be replicated into SAP HANA in a cost effective or timely
manner
Analysis will likely require combining data from Hadoop , SAP HANA and other sources
Two approaches:
Two-Phase Analytics : run analysis continually o Hadoop, then periodic updates to SAP HANA for
fast interactive query response
Federated Queries:
Split analysis into parts and run async on Hadoop & SAP HANA
Federate results in SAP HANA or BI
Intel Inside. Powerful Solution Outside.
FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
Copyright 2014 FUJITSU LIMITED
Copyright 2014 FUJITSU
2015 LIMITED
FUJITSU
Cualitativos
Cuantitativos
Availability class
99.5%
Managed operations
24 7
Disaster-recovery class
Managed performance
Dialog response
time 90% < 1 sec.
Additional
Certification(s)
ISAE3402 (SOX),
SAS70
Powered by Intel
Xeon processor.
2015 FUJITSU
Las transacciones
representanla utilizacin
real del sistema SAP y
estn vinculadas al negocio
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
SERVICIOS INCLUDOS
PAGO MENSUAL EN
FUNCIN DE LA
MEMORIA CONSUMIDA
EN HANA
Powered by Intel
Xeon processor.
2015 FUJITSU
Service Governance
(Service Desk, Service-Management)
Level 5
Level 3
OPENSTACK FRAMEWORK
Level 2
Level 1
Level 4
Powered by Intel
Xeon processor.
2015 FUJITSU
SERVICIOS INCLUDOS
Powered by Intel
Xeon processor.
2015 FUJITSU
Take Aways
Powered by Intel
Xeon processor.
2015 FUJITSU
Summary
TAKE AWAYS
SAP HANA excels at speed and structure, plus is fully integrated with Business Suite Enterprise Logic
Leverage strenghs of both platforms in data store, data processing and analytics scenarios
Carefully evaluate your requirements and use case against these scenarios
If you are about to start with Hadoop, use Apache Spark & Vora
Powered by Intel
Xeon processor.
2015 FUJITSU
Powered by Intel
Xeon processor.
2015 FUJITSU
Rumbo 2020
FTS INTERNAL