Part 1: The current facts
I've been receiving a lot this question: when I have SAP HANA either with storage or HANA System Replication in a synchronous mode, depending on the network latency, it will not meet the TDI KPIs. Will SAP accept this? What would be the performance impact at an application level?
The question came as this customer wanted all the performance he could get, but also the maximum data resiliency technology could provide today.
In a nutshell, there are certain situations that the "Laws of Physics" do not allow you to have all you want, and you need to chose: either the maximum performance, or the maximum data resiliency.
Where did I get the information that it is acceptable to fail KPIs?
Then, I'll also share the reasoning that followed after getting this question cleared up, by providing a concrete customer example I hope helps you build your own reasoning in case you're going through a similar discussion in your organization as well.
Network latency impact on SAP HANA performance when replicating synchronously
There are two in particular relevant to this matter: The SAP HANA Network Requirements whitepaper: http://scn.sap.com/docs/DOC-63221
In page 28 it says:
- There is no straightforward recommendation regarding the bandwidth and the latency a system replication network must provide. A rough estimation of the required performance is given in the How-to Guide Network Required for SAP HANA System Replication, which is referred to from the SAP Note 1999880 - FAQ: SAP HANA system replication
- Latency: The redo log shipping time for 4 KB log buffers must be less than a millisecond or in a low single-digit millisecond range – depending on the application requirements (relevant for synchronous replication only).
Transactional scenarios (like SAP ERP on HANA) are more sensitive to latency. Analytical scenarios are more sensitive to throughput.
Here, the same as above is mentioned:
- All changes to data are captured in the redo log. The SAP HANA database asynchronously persists the redo log with I/O orders of 4 KB to 1 MB size into log segment files in the log volume (i. e. on disk). A transaction writing a commit into the redo log waits until the buffer containing the commit has been written to the log volume. This wait time for 4 KB log buffers should be less than a millisecond or in a low single-digit millisecond range.
Even without replication, it is your choice whether is acceptable to fail KPIs
- This is always up to the customer to decide if falling below the KPIs is acceptable for his/her daily operation of the SAP HANA system. The customer must decide whether the performance of his/her SAP HANA system is sufficient for his/her needs.
- The following questions and examples might help for making that decision:
- Is the given SAP HANA system a non-productive system?
- The KPIs apply for production systems only. In general, for non-productive systems weaker performance is acceptable
- Which SAP HANA scenarios are run on the given system?
- OLAP scenarios only (e.g. SAP BW-on-HANA):
- The performance of queries is usually not affected if all required tables have been loaded into memory
- Latency times of the log volume and throughput rates for writing the data volume are mainly relevant when loading data from source systems
- Throughput rates for reading from the data or the log volume affect the overall system restart time (e.g. after applying an SAP HANA revision update)
- OLTP scenarios only (e.g. SAP Business-Suite-on-HANA):
- Latency times of the log volume affect the duration of every transaction that changes one or more tables in the database
- Throughput rates for reading from the data or the log volume affect the overall system restart time (e.g. after applying an SAP HANA revision update)
- See the Storage Requirements whitepaper for details: SAP HANA TDI - Storage Requirements | SCN
- OLAP scenarios only (e.g. SAP BW-on-HANA):
- Is the given SAP HANA system a non-productive system?
No comments:
Post a Comment