SAP Data Intelligence ABAP Integration with Genera...

DanielIngenhaag · ‎02-15-2022

This is my second part of the blog series about SAP Data Intelligence ABAP integration with generation 2 operators. After going through a lot of theory including prerequisites, concepts etc. for generation 2 ABAP integration in my blog post part 1-Overview and Introduction I will create a real scenario in SAP Data Intelligence using a graph for generation 1 and generation 2 operators in this second blog post. The scenario includes the extraction of data from a custom CDS view located in an SAP S/4HANA 2020 (on-premise) system and loading the data into CSV files in a Google Cloud Storage (GCS) target using initial load and delta load capabilities.

To get started, we first create the required connections that we need for this scenario.

SAP Data Intelligence – Create Connection to SAP S/4 HANA source system & Google Cloud Storage:

First of all, we need to define the required connections in the Connection Management application pointing to SAP S/4HANA and Google Cloud Storage:

Connection to SAP S/4HANA 2020 using Connection type ABAP and RFC:

Note: In this case the SAP S/4HANA 2020 system has been equipped with all necessary Notes incl. the TCI 3105880.

Connection to Google Cloud Storage (GCS):

CDS view in the source SAP S/4HANA system:

The sample scenario in this blog will use an SAP S/4HANA system (on-premise) and a custom CDS view called “ZCDS_EPM_PO”, which has been created based on the EPM data model that comes with an SAP S/4HANA system. The CDS view is built based on table SEPM_BPA (Business Partners) and has the required Analytics annotations for data extraction + delta extraction as described here: https://help.sap.com/viewer/3a65df0ce7cd40d3a61225b7d3c86703/Cloud/en-US/55b2a17f987744cba62903e97dd...

More information about the EPM data model can be found here: https://help.sap.com/viewer/a602ff71a47c441bb3000504ec938fea/202009.latest/en-US/57e9f59831e94311852...

SAP Data Intelligence – Create and Execute a Generation 1 Graph

The design of the graph is quite simple and looks as follows:

Define CDS view extraction
Configure the CDS Reader Operator (here V2 is used) to extract data from CDS view “ZCDS_EPM_PO”.

Connection: “S4H_2020”

Version: ABAP CDS Reader V2

Subscription type: New

Subscription name: ZCDS_EPM_PO_01

ABAP CDS Name: ZCDS_EPM_PO

Transfer Mode: Replication

Records per Roundtrip: 5000

ABAP Data Type Conversion*: Enhanced Format Conversion (default)

* This parameter has been introduced recently, and its availability in the operator configuration panel depends on your SAP source system. If your system does not have the latest SAP Notes + TCI Note, this parameter might not be visible for you.

Get CDS view columns

In a second step, a small Python operator is used to extract the CDS view columns out of the header provided in the output message of the CDS Reader operator to generate a CSV stream that contains the CDS view columns in the first row and the actual data body starting in the second line of the CSV data. Therefore, one input port of type message and one output port of type message are used.

There are various ways how to achieve this and you could, for example, pick the following example provided in a blog post by Yuliya Reich: https://blogs.sap.com/2021/02/15/sap-data-intelligence-how-to-get-headers-of-cds-view/. Note that the coding might need to be adjusted depending on your individual use case and the version of SAP S/4HANA you are using.

Code snippet based on Python 3.6:

from io import StringIO

import csv

import pandas as pd

import json



def on_input(inData):

    # read body

    data = StringIO(inData.body)



    # read attributes

    var = json.dumps(inData.attributes)

    result = json.loads(var)



    # from here we start json parsing

    ABAP = result['ABAP']

    Fields = ABAP['Fields']



    # creating an empty list for headers

    columns = []

    for item in Fields:

        columns.append(item['Name'])  



    # data mapping using headers & saving into pandas dataframe

    df = pd.read_csv(data, index_col = False, names = columns)



    # here you can prepare your data,

    # e.g. columns selection or records filtering



    df_csv = df.to_csv(index = False, header = True)

    api.send('outString', df_csv)



api.set_port_callback('inData', on_input)

Write CSV file into Google Cloud Storage

As a last step the data needs to be loaded to Google Cloud Storage via CSV files. As a necessary prerequisite, the “to File” operator needs to be used as conversion operator to bring the data into the correct format that is required by the Write File operator.

In the Write File Operator configuration panel, we define the following parameters:

Connection: GCS

Path mode: Static with Placeholders

Path: /demo/Generation1/ZCDS_EPM_PO/<date>/PO_<time>_<counter>.csv

Mode: Create only

If file exists: Fail

Join Batches: False

In this example file path “/demo/Generation1/ZCDS_EPM_PO/<date>/PO_<time>_<counter>.csv” is using different placeholder functions date, time and counter that will be generated during runtime of a graph whenever a data package is leaving the CDS Reader Operator and will be written into GCS. In this example each incoming data package will be stored and grouped in one folder of the current day, where each file name consists of the prefix “PO_” followed by the time of the data set arrival in the Write file operator and an integer counter that starts with 1 and will be enumerated by +1 for each data set streaming through the graph.

Example: /demo/Generation1/ZCDS_EPM_PO/20220103/PO_071053_1.csv

Please note that when you use the “Create Only” file mode that the placeholders in our example take care of writing all chunks of data into a dedicated file. When you use a plain graph without placeholders and only the first chunk of data will be written to GCS. Technically you can also make use of “Append” file mode, but depending on your scenario an “Append” creating large files might lead to memory problems when large files are created.

There are of course multiple other ways of designing the file name pattern that you can change based on your specific needs. You could for example create your own header information also in python and hand it over to the Write File operator as an input.

As we expect to run the graph 24x7 for initial load and change data capturing we do not add any graph terminator at the end of the graph.

When we now execute the graph, we will see that the following CSV files will be created in Google Cloud Storage as a target using the Metadata Explorer:

Once the initial load is done, there will be one file generated per incoming delta which occurred in the source. This can lead to smaller files being generated in case there is a low number of changes in your source system. Also here you can of course create mechanisms to merge batches into bigger chunks, but in this case, we want to keep it simple.

SAP Data Intelligence – Create and Execute a Generation 2 Graph:

Now we want to take the same use case we implemented with generation 1 operators to load data from a CDS view to CSV files in Google Cloud Storage and create a graph using the generation 2 operators. The high-level design of the generation 2 graph looks as follows:

1. Define CDS view extraction
In the first part we configure the Read Data from SAP System operator to extract data from CDS view “ZCDS_EPM_PO” using the following configuration parameters:

Connection
The Connection of your SAP system created in the Connection Management containing all the necessary prerequisites mentioned in the beginning of this blog post.

Version
Select the version of the Read Data from SAP System operator. Please not that currently only one version is available of this operator.

Object Name
Browse and select the data set (in this case CDS view ZCDS_EPM_PO) you want to extract. Depending on your connected SAP system, you will either be able to browse within CDS folder (SAP S/4HANA Cloud or on premise) or within SLT folder (SAP Business Suite, SAP S/4HANA Foundation and SLT system).

Transfer Mode
Choose between Initial Load, Replication and Delta Load transfer.

In our example scenario, we will select the following configurations:

Select the connection to SAP S/4HANA, here “S4H_2020”

Browse and select the Operator in “Version” parameter

Browse the “Object” parameter to select “ZCDS_EPM_PO” CDS view:

Please note that you should not manually change the displayed value in the Object Name parameter once you selected your data set.

Select Transfer Mode: “Replication”

2. Generate binary stream in CSV
Drag and drop the “Table to Binary” operator to the modelling canvas the and connect the output of the Read Data from SAP System operator to the input of the “Table to Binary” operator. The Table to Binary standard operator is required because the Binary Write File operator is requiring a binary input stream. It takes the table-based input from Read Data from SAP System operator and provides different capabilities in which format the data can be further processed, e.g. using CSV, JSON, JSON Lines or Parquet.In our case we select CSV as output format and the following CSV properties:

Note: The “colNames” parameter can be left empty as the column Names will automatically be extracted.

3. Write Partitioned CSV files to Google Cloud Storage

Drag and drop the generation 2 “Binary File Producer” to load the data via CSV files into Google Cloud Storage. Connect the output port “binary” of Table to Binary operator to the input port of the Binary File Producer operator.

In the Binary File Producer operator configuration panel, we define the following parameters:

Connection ID: GCS

Path mode: Static with Placeholders

Path: /demo/Generation2/ZCDS_EPM_PO.csv

Mode: Create only

File mode: Partitioned File (required for using snapshots and resilience)

Error Handling: terminate on error

In this example partitioned files File mode has been selected, which is a necessary prerequisite if we want to use graph resilience feature with snapshots with this operator. As file path parameter I have chosen: “/demo/Generation2/ZCDS_EPM_PO.csv”.

The main difference comparing the Generation 1 graph is that we now have a file mode parameter where we select “partitioned file mode”, which is required for using the snapshot feature. When partitioned file mode is selected, we can currently only define the path up to the partitioned file folder, in this example “ZCDS_EPM_PO.csv”, but not the actual file names of the various part files that are being generated inside our specified folder. The file name pattern is given and by using a generated uuid as well as the batch index that is included in the header information. More information about the handling of files including file name patterns can be checked here: https://help.sap.com/viewer/97fce0b6d93e490fadec7e7021e9016e/Cloud/en-US/64d01ad02e39499594b1eb10397...

There are of course multiple other ways of how you want to store your data in your file storage target system, e.g. using placeholders similar to generation 1 Write File operator in the file path, but in this case we want to keep it simple without using placeholders.

As we expect to run the graph 24x7 for initial load and change data capturing we do not use any graph terminator at the end of the graph.

Spoiler Alert: Usage of last batch and Graph Terminator in initial load mode

Using generation 2 Graphs and Operators there is also an improvement when e.g. using the Initial Load mode in the Read Data from SAP System operator which uses the “isLast” Batch attribute in the header, which is indicating whether the last chunk of data has been loaded from the SAP source system.

In the Graph Terminator operator there is a configuration parameter “On Last Batch”. If we change our graph to perform an “initial Load” in Transfer Mode parameter, add a graph Terminator after the Binary File Producer and set “on Last Batch” to True, the graph will automatically go to completed state when the load is completed:

Furthermore, you could also combine the usage of the Read Data from SAP System operator with structured data operators such as the Data Transform operator, e.g. for performing a projection on the CDS view, which was not possible using the generation 1 operators due to different port types being used by CDS Reader and Data Transform operators:

But now it’s time to execute our graph (using Replication Mode in the Read Data from SAP System operator) .

The way we execute the graph including the resiliency feature & snapshot is being done by clicking the “Run As” button in the Modeler:

In the pop-up dialog, you can specify different options:

Run Graph As: Specify the name of the graph execution that will be shown in the Modeler
e.g.: Load Purchase Order to Google Cloud Storage.

Snapshot Configuration: Enable
Every 30 seconds (default value)Note: Currently when using the Read Data from SAP System operator snapshots are mandatory and need to be enabled.

Recovery Configuration: Enable
Retry for : “2” runs within the threshold value of “30” seconds.

Note: This parameter specifies whether the automatic restart feature should be used, which is optional. Whenever a graph is failing this feature will take care that the graph is restarted automatically with the maximum number of retries that are specified in “Retry for” within the time interval specified in the “threshold” parameter.

During the start time, you can check the Process Logs tab of your running graph and check out some new metrics that are provided by the generation 2 Read Data from SAP System operator, e.g.

The completion rate of the access plan calculation in the source ABAP system (“calculation phase”)

The completion rate of the initial load phase (“loading phase”)

Begin of the replication / delta load phase

Check the Work Process number as a reference to check it in the SAP ABAP backend system (“work process number”)

In addition, a new metric for the number of processed records has been introduced for the generation 2 Read Data from SAP System operator:

After the graph status is switching to “running” we can open the Metadata Explorer application to check the files being transferred to Google Cloud Storage.

When we browse inside the directory to the file path we defined “/demo/Generation2/ZCDS_EPM_PO.csv”, we will see that the following directory has been created:

Inside the directory we can now see various part files that were replicated, where each part-file represents one batch of CDS view records. The generated part files follow the naming pattern

“part-<uuid>-<batch_index>.<file_extension>”,

e.g.: part-ffbc63eb-c582-4f90-b3c6-dbeb362e9977-0.csv

We can now also perform a quick data preview for one of the files to check the actual data that has been replicated from SAP S/4HANA to Google Cloud Storage:

Once the initial load is done, there will be one part file generated per incoming delta which occurred in the source. This can lead to smaller files being generated in case there are is a low change rate for the data in the source system. You can of course add mechanisms into your pipeline to merge batches into bigger chunks etc.

With our current configuration of snapshots and auto-restart, the graph will automatically re-start in case of an error and continue the replication using the snapshot information that has been captured at the point of failure.

In case an error happens, you will see in the lower section of the modeler that a new instance of the graph will be started and the failed one will still exist. Both executions are belonging and grouped together under the main graph we started:

Before the error happens, we can see in the Metadata Explorer that up to batch index 5 the CSV files has been generated in Google Cloud Storage:

After the graph successfully restarts, we can see that the replication continues with batch index 6 to continue the replication:

In the overview section of your running or failed graph, you can also check the values you defined upon graph execution as well as the failure count etc.:

Conclusion

Now that’s it. I hope I could give you some useful information about the usage of ABAP integration with SAP Data Intelligence using Generation 2 graphs based on a sample scenario.

Please feel free to add your feedback and question in the comment section of this blog post.

tim_huse · ‎02-15-2022

Very helpful blog, Daniel 🙂

guest9 · ‎02-21-2022

Thanks for sharing Daniel. Very simple steps.

manojketha · ‎06-14-2022

Very helpful information, Daniel! Thank you for sharing this.

bharathkote · ‎06-22-2022

Getting an error(mentioned below) while using generation1 pipeline

Group messages: Group: default; Messages: Graph failure: operator.com.sap.system.python3Operator:python3operator1: Error while executing callback registered on port(s) ['inData']: 'Fields' [line 16] Process(es) terminated with error(s). restartOnFailure==false

jimgiffin · ‎06-22-2022

Depending on your source system and SP levels, sometimes the metadata comes out in slightly different format. The error is due to the Python looking for the variable Fields which comes from this bit in the Python Code

# from here we start json parsing

    ABAP = result['ABAP']

    Fields = ABAP['Fields']

Put a wiretap in front of the Python and inspect the metadata coming out from your SAP system and adjust the python variables to point at the correct json string (not ABAP.Fields)

rajarshi_muhuri · ‎06-22-2022

Is there a way we could have delta via batch instead of the graph running continuously ?

msurge · ‎06-23-2022

Very helpful Daniel, thanks. one question. is the data contract maintain going from generation 1 to generation 2? a better question might be, how do we maintain the data contract going from gen 1 to gen 2? thanks

former_member802271 · ‎08-24-2022

Hi Daniel,

usefull Blog.

Any advise on big volume?

We are trying to extract ACDOCA via CDS view, and the system doesn't seems to be fast enough with 90 millions and 600 millions records.

Thanks in advance

ravicondamoor · ‎11-18-2022

Hi,

Is it possible to replicate parameterized CDS views via RMS or Gen 2 operators?

Thanks

-ravi

DanielIngenhaag · ‎11-28-2022

Hi Ravi,

please check the following SAP note: https://launchpad.support.sap.com/#/notes/2890171

Kind regards,

Daniel

DanielIngenhaag · ‎11-28-2022

Hi Frank,

if you want to perform 1:1 data replication with minor projections or filters I would recommend to check out the Replication Flow functionality instead of using pipelines.

Kind regards,

Daniel

DanielIngenhaag · ‎11-28-2022

Hi Rajarshi,

You may want to try out the pause / suspend & resume mechanisms provided by generation 2 pipelines to achieve your goal.

Kind regards,

Daniel

baldiway · ‎12-12-2022

Hello James,

I got the same error

File "/vrep/vflow/subdevkits/pysix_subengine/pysix_subengine/callbacks.py", line 321, in __run_single_port_callback_monitor self.__callback(data) File "<string>", line 16, in on_input KeyError: 'Fields' 'Fields' [line 16]

and I debug the flow and the attributes corresponding to ABAP.Fields. I tried some changes in code but without success. All time the python complains of use ['Fields'].

Dou you have another clue ? Look the process finished with the file generated ok, but the status of execution remains dead ....

dead 5 minutes ago..

Thanks in Advanced.

{"ABAP":{"Fields":[{"Decimals":0,"Kind":"C","Length":4,"Name":"BUKRS","Type":"BUKRS"},{"Decimals":0,"Kind":"C","Length":10,"Name":"HKONT","Type":"HKONT"},{"Decimals":0,"Kind":"D","Length":16,"Name":"AUGDT","Type":"AUGDT"},{"Decimals":0,"Kind":"C","Length":10,"Name":"AUGBL","Type":"AUGBL"},{"Decimals":0,"Kind":"C","Length":18,"Name":"ZUONR","Type":"DZUONR"},{"Decimals":0,"Kind":"N","Length":4,"Name":"GJAHR","Type":"GJAHR"},{"Decimals":0,"Kind":"C","Length":10,"Name":"BELNR","Type":"BELNR_D"},{"Decimals":0,"Kind":"N","Length":3,"Name":"BUZEI","Type":"BUZEI"},{"Decimals":0,"Kind":"D","Length":16,"Name":"BUDAT","Type":"BUDAT"},{"Decimals":0,"Kind":"D","Length":16,"Name":"BLDAT","Type":"BLDAT"},{"Decimals":0,"Kind":"C","Length":5,"Name":"WAERS","Type":"WAERS"},{"Decimals":0,"Kind":"C","Length":16,"Name":"XBLNR","Type":"XBLNR1"},{"Decimals":0,"Kind":"C","Length":2,"Name":"BLART","Type":"BLART"},{"Decimals":0,"Kind":"N","Length":2,"Name":"MONAT","Type":"MONAT"},{"Decimals":0,"Kind":"C","Length":2,"Name":"BSCHL","Type":"BSCHL"},{"Decimals":0,"Kind":"C","Length":1,"Name":"SHKZG","Type":"SHKZG"},{"Decimals":0,"Kind":"C","Length":4,"Name":"GSBER","Type":"GSBER"},{"Decimals":0,"Kind":"C","Length":3,"Name":"TAX_COUNTRY","Type":"FOT_TAX_COUNTRY"},{"Decimals":0,"Kind":"C","Length":2,"Name":"MWSKZ","Type":"MWSKZ"},{"Decimals":0,"Kind":"D","Length":16,"Name":"TXDAT_FROM","Type":"FOT_TXDAT_FROM"},{"Decimals":0,"Kind":"N","Length":3,"Name":"FKONT","Type":"FIPLS"},{"Decimals":2,"Kind":"P","Length":12,"Name":"DMBTR","Type":"DMBTR"},{"Decimals":2,"Kind":"P","Length":12,"Name":"WRBTR","Type":"WRBTR"},{"Decimals":2,"Kind":"P","Length":12,"Name":"MWSTS","Type":"MWSTS"},{"Decimals":2,"Kind":"P","Length":12,"Name":"WMWST","Type":"WMWST"},{"Decimals":0,"Kind":"C","Length":50,"Name":"SGTXT","Type":"SGTXT"},{"Decimals":0,"Kind":"C","Length":16,"Name":"PROJN","Type":"PROJN"},{"Decimals":0,"Kind":"C","Length":12,"Name":"AUFNR","Type":"AUFNR_NEU"},{"Decimals":0,"Kind":"C","Length":4,"Name":"WERKS","Type":"WERKS_D"},{"Decimals":0,"Kind":"C","Length":10,"Name":"KOSTL","Type":"KOSTL"},{"Decimals":0,"Kind":"D","Length":16,"Name":"ZFBDT","Type":"DZFBDT"},{"Decimals":0,"Kind":"C","Length":1,"Name":"XOPVW","Type":"XOPVW"},{"Decimals":0,"Kind":"D","Length":16,"Name":"VALUT","Type":"VALUT"},{"Decimals":0,"Kind":"C","Length":1,"Name":"BSTAT","Type":"BSTAT_D"},{"Decimals":2,"Kind":"P","Length":12,"Name":"BDIFF","Type":"BDIFF"},{"Decimals":2,"Kind":"P","Length":12,"Name":"BDIF2","Type":"BDIF2"},{"Decimals":0,"Kind":"C","Length":6,"Name":"VBUND","Type":"RASSC"},{"Decimals":0,"Kind":"C","Length":5,"Name":"PSWSL","Type":"PSWSL"},{"Decimals":0,"Kind":"C","Length":1,"Name":"WVERW","Type":"WVERW"},{"Decimals":2,"Kind":"P","Length":12,"Name":"DMBE2","Type":"DMBE2"},{"Decimals":2,"Kind":"P","Length":12,"Name":"DMBE3","Type":"DMBE3"},{"Decimals":2,"Kind":"P","Length":12,"Name":"MWST2","Type":"MWST2"},{"Decimals":2,"Kind":"P","Length":12,"Name":"MWST3","Type":"MWST3"},{"Decimals":2,"Kind":"P","Length":12,"Name":"BDIF3","Type":"BDIF3"},{"Decimals":2,"Kind":"P","Length":12,"Name":"RDIF3","Type":"RDIF3"},{"Decimals":0,"Kind":"C","Length":1,"Name":"XRAGL","Type":"XRAGL"},{"Decimals":0,"Kind":"N","Length":8,"Name":"PROJK","Type":"PS_PSP_PNR"},{"Decimals":0,"Kind":"C","Length":10,"Name":"PRCTR","Type":"PRCTR"},{"Decimals":0,"Kind":"C","Length":1,"Name":"XSTOV","Type":"XSTOV"},{"Decimals":0,"Kind":"C","Length":1,"Name":"XARCH","Type":"XARCH"},{"Decimals":2,"Kind":"P","Length":12,"Name":"PSWBT","Type":"PSWBT"},{"Decimals":0,"Kind":"C","Length":1,"Name":"XNEGP","Type":"XNEGP"},{"Decimals":0,"Kind":"N","Length":3,"Name":"RFZEI","Type":"RFZEI_CC"},{"Decimals":0,"Kind":"C","Length":10,"Name":"CCBTC","Type":"CCBTC"},{"Decimals":0,"Kind":"C","Length":20,"Name":"XREF3","Type":"XREF3"},{"Decimals":0,"Kind":"C","Length":4,"Name":"BUPLA","Type":"BUPLA"},{"Decimals":2,"Kind":"P","Length":12,"Name":"PPDIFF","Type":"PPDIFF"},{"Decimals":2,"Kind":"P","Length":12,"Name":"PPDIF2","Type":"PPDIF2"},{"Decimals":2,"Kind":"P","Length":12,"Name":"PPDIF3","Type":"PPDIF3"},{"Decimals":0,"Kind":"C","Length":3,"Name":"BEWAR","Type":"RMVCT"},{"Decimals":0,"Kind":"C","Length":8,"Name":"IMKEY","Type":"IMKEY"},{"Decimals":0,"Kind":"D","Length":16,"Name":"DABRZ","Type":"DABRBEZ"},{"Decimals":0,"Kind":"C","Length":13,"Name":"INTRENO","Type":"VVINTRENO"},{"Decimals":0,"Kind":"C","Length":20,"Name":"GRANT_NBR","Type":"GM_GRANT_NBR"},{"Decimals":0,"Kind":"C","Length":16,"Name":"FKBER","Type":"FKBER"},{"Decimals":0,"Kind":"C","Length":14,"Name":"FIPOS","Type":"FIPOS"},{"Decimals":0,"Kind":"C","Length":16,"Name":"FISTL","Type":"FISTL"},{"Decimals":0,"Kind":"C","Length":10,"Name":"GEBER","Type":"BP_GEBER"},{"Decimals":0,"Kind":"C","Length":10,"Name":"PPRCT","Type":"PPRCTR"},{"Decimals":0,"Kind":"C","Length":1,"Name":"BUZID","Type":"BUZID"},{"Decimals":0,"Kind":"N","Length":4,"Name":"AUGGJ","Type":"AUGGJ"},{"Decimals":0,"Kind":"C","Length":2,"Name":"UZAWE","Type":"UZAWE"},{"Decimals":0,"Kind":"C","Length":10,"Name":"SEGMENT","Type":"FB_SEGMENT"},{"Decimals":0,"Kind":"C","Length":10,"Name":"PSEGMENT","Type":"FB_PSEGMENT"},{"Decimals":0,"Kind":"C","Length":10,"Name":"PGEBER","Type":"FM_PFUND"},{"Decimals":0,"Kind":"C","Length":20,"Name":"PGRANT_NBR","Type":"GM_GRANT_PARTNER"},{"Decimals":0,"Kind":"C","Length":24,"Name":"MEASURE","Type":"FM_MEASURE"},{"Decimals":0,"Kind":"C","Length":10,"Name":"BUDGET_PD","Type":"FM_BUDGET_PERIOD"},{"Decimals":0,"Kind":"C","Length":10,"Name":"PBUDGET_PD","Type":"FM_PBUDGET_PERIOD"},{"Decimals":0,"Kind":"C","Length":10,"Name":"BDGT_ACCOUNT","Type":""},{"Decimals":0,"Kind":"C","Length":10,"Name":"RE_ACCOUNT","Type":""},{"Decimals":0,"Kind":"C","Length":1,"Name":"FIPEX","Type":""},{"Decimals":0,"Kind":"D","Length":16,"Name":"_DATAAGING","Type":"DATA_TEMPERATURE"},{"Decimals":0,"Kind":"C","Length":30,"Name":"KIDNO","Type":"KIDNO"},{"Decimals":0,"Kind":"N","Length":6,"Name":"PRODPER","Type":"JVA_PROD_MONTH"},{"Decimals":0,"Kind":"C","Length":2,"Name":"QSSKZ","Type":"QSSKZ"},{"Decimals":0,"Kind":"C","Length":13,"Name":"PROPMANO","Type":"REHORECNNRM"},{"Decimals":0,"Kind":"C","Length":10,"Name":"GKONT","Type":"GKONT"},{"Decimals":0,"Kind":"C","Length":1,"Name":"GKART","Type":"GKOAR"},{"Decimals":0,"Kind":"C","Length":10,"Name":"GHKON","Type":"GHKONT"},{"Decimals":0,"Kind":"C","Length":10,"Name":"LOGSYSTEM_SENDER","Type":""},{"Decimals":0,"Kind":"C","Length":4,"Name":"BUKRS_SENDER","Type":""},{"Decimals":0,"Kind":"C","Length":10,"Name":"BELNR_SENDER","Type":""},{"Decimals":0,"Kind":"N","Length":4,"Name":"GJAHR_SENDER","Type":""},{"Decimals":0,"Kind":"N","Length":3,"Name":"BUZEI_SENDER","Type":""},{"Decimals":0,"Kind":"C","Length":5,"Name":"HBKID","Type":""},{"Decimals":0,"Kind":"C","Length":5,"Name":"HKTID","Type":""},{"Decimals":0,"Kind":"C","Length":10,"Name":"EBELN","Type":""},{"Decimals":0,"Kind":"N","Length":5,"Name":"EBELP","Type":""},{"Decimals":0,"Kind":"C","Length":1,"Name":"/1DH/OPERATION","Type":"CHAR1"}],"Header":{},"Kind":"Table"},"message.attributes.encoding":"csv","message.batchIndex":0,"message.lastBatch":false,"metadata":[]}

jimgiffin · ‎12-12-2022

Wagner,

Are you testing with Initial Load? In that scenario - the final batch does not contain the metadata block so the logic fails on the final "payload"

Try to add this check to your logic before you attempt to parse the metadata:

if 'message.lastBatch' in header and header['message.lastBatch']:

baldiway · ‎12-12-2022

Ok. Thank you for explanation.

rajeshps · ‎02-01-2023

daniel.ingenhaag

Gday! Requesting you to please check on the below Question and revert

https://answers.sap.com/questions/13795123/call-abap-proxy-from-sap-di.html

Thankyou in advances! Appreciate your Valuable Inputs on above question

satishkumar1 · ‎04-17-2023

daniel.ingenhaag ,

Thanks for sharing the content here. It's very much helpful to me. I have a similar requirement.

I need to place the file in SFTP folder with the name TEST_2023_04_17.csv. I tried with the placeholders in the write file operator but it's producing as TEST_20230417.csv

Could you please tell me how to achieve this requirement

Thankyou in advance

tgorhan · ‎07-26-2023

Hello,

when i run the graph the metrics tell that data is processed by the operator "Read Data From SAP System". so in the process log, next operator, e.g. "Structured File Producer" tells it runs.

But in when we browse inside the directory to the file path we defined we not see the new file.

Same if we direktly write in an cloud table, it tells it runs, but you do not see a change.

when you stop the graph manually, the file is created / data is written. Why the change is only when we are stopping the graph and how can we change that?

Thanks in advance

jimgiffin · ‎07-26-2023

tgorhan I have seen this behavior before and is usually means one or more missing SAPNotes in the source / SLT instance. Can you run the note analyzer 3016862 for your scenario(s) and see if any are missing.

tgorhan · ‎07-27-2023

thanks a lot!

tgorhan · ‎07-28-2023

thanks now it works better!

Is there a possibility to have the delta loaded as kind of batch by night?

So that we register it to CDS once and after that run delta jobs by night?

And if yes haw it has to be configured, because now it is only a real time replication right?

Best regards,

Tobias

pulk1307 · ‎11-08-2023

tgorhan - Were you able to get the configuration for for running the graphs in batch mode ? Is this something you can share.

tgorhan · ‎11-08-2023

no, this is not possible.

We changed our approach to:

use RMS Flows (replication engine)
create a graph for start and stop the flows by api, see blog:
https://blogs.sap.com/2022/11/11/scheduling-rms-flows-in-data-intelligence-via-the-internal-api/
Schedule the graphs for starting and stopping the RMS Flows (e.g. that it run for 3 hours in the night)

rajeshps · ‎11-08-2023

Yes it is possible to run the graph in batch mode, you can have cron expression set via scheduler and to handle volumes add a simple batcher customer operator if max.poll.records or max.duration is reached then stop the graph and beauty and power of message batching you can set sensible value and perform regression and so start and stop is possible which doesnt impact the kubernetes worker nodes, pods space or respurce/space utilization.

pulk1307 · ‎11-08-2023

If I understand correctly, you have scheduled a graph for controlling the running of the RMS flow. Since you have created a schedule. a com.sap.dh.schedular instance will be running continuously, the SAP DI instance will not be able to hibernate. Is your instance running 24*7 ? How do you control the system consumption cost.

tgorhan · ‎11-08-2023

but if the graph stops in generation 2 graph, you will loose your delta of CDS

tgorhan · ‎11-08-2023

it was part of a poc, I do not have a answer for that, but I think there is a question posted about that in the last days