Tuesday, December 13, 2016

The power of Hadoop integrated with SAP HANA...

SAP HANA and Hadoop



In the article What is Hadoop? we talked about what exactly is Hadoop, What are its advantages and how it can be best applied. 

Now let's see how Hadoop and HANA can be integrated with each other. 

The power of Hadoop integrated with SAP HANA:

As we understood so far that Hadoop can store very huge amount of data. It is well suited for storing unstructured data, is good for manipulating very large files and is tolerant to hardware and software failures. 
But the main challenge with Hadoop is getting information out of this huge data in real time. 

We also have SAP HANA and as we all know that HANA is well suited for processing data in Real time. Hence SAP HANA and Hadoop is a perfect match. 

To get real time information from massive storage such as Hadoop, we can use HANA and HANA can be directly integrated to Hadoop.
So we can combine Hadoop and HANA to get real time information from huge data. 

With the help of SAP HANA Hadoop Integration we can also combine both structured and un-structured data. Structured and un-structured data are combined and transferred to SAP HANA via a Hadoop / HANA Connector. 

What does Hadoop bring to HANA?

    • Cost efficient data storage and processing for large volumes of structured, semi-structured, and unstructured data such as web logs, machine data, text data, call data records (CDRs), audio, video data.
    • Batch Processing
    • Where fast response times are less critical than reliability and scalability.
    • Complex Information Processing
    • Enable heavily recursive algorithms, machine learning, & queries that cannot be easily expressed in SQL.
    • Low Value Data Archive & Data stays available, though access is slower.
    • Post-hoc Analysis
    • Mine raw data that is either schema-less or where schema changes over time.

SAP HANA and Hadoop Integration:

Hadoop is considered as one of the best in storing the structured, semi-structured and unstructured data. 
Combined structured and un-structured data are transferred to SAP HANA via a Hadoop / HANA Connector. BODS is one of main way to pull data to HANA. 

SAP has also set up a "big-data" partner council, which will work to provide products that make use of HANA and Hadoop. One of the key partners is Cloudera. SAP wants it to be easy to connect to data, whether it's in SAP software or software from another vendor. 
 
SAP Data Services: Simple GUI build and run ETL process