INETUM - Inexus - Self Organizing Data
Log File Analysis and Lineage Extraction
Uploading Log Files
- The process begins with selecting various log files from different systems, including Oracle, middleware, front end, SAS, PGA, dashboard, DB2, and Kubal JCL.
- The first log file uploaded is the front-end log associated with an HTML5 JavaScript application.
Extracting Lineage
- The AI models Gro Llama Code and GPT-4 are tasked to analyze the uploaded log file to extract data lineage independently.
- Another AI model (GPT-4 or CL Opus) will verify the extracted lineage for integrity and accuracy before representing it as a graph.
Enriching Lineage Data
- The verification process includes identifying errors and duplicates in the lineage data while merging any duplicates found.
- By utilizing multiple AIs for independent checks, the risk of hallucination in outputs is minimized.
Growth of Lineage Graph
- As more log files are added to the analysis, connections within the data increase leading to a more extensive lineage graph.
- An exported lineage table summarizes user inputs and transformations applied at each step of processing.
JSON File Download
- Users can download a JSON file containing detailed lineage information such as source elements, transformation rules, targets, and human-readable descriptions.
- This JSON data can be converted into SQL statements for further use in database queries or applications.