In this paper we introduce a data integration system by implementing a function into the context of PostgreSQL. The aim of this work is to collect files to process from two different data sources (a platform of Physical Testing Software (PTS) and another one of Physical Simulation Software (PSS)), in order to retrieve specific records through a query and integrate them. Both these platforms contain a large amount of files in semi-structured or unstructured format. This approach allows analysing data from different sources and creating a database remaining always in the PostgresSQL context. Indeed, the code is modular and it can be customizable for the specific scope. This approach reduces the information exchange with the client and the computational cost, because the instructions are applied all in once (on-the-fly) to the files stored in the Network File System (NFS). Furthermore, the integration outputs and the related data can be stored within it. One of the objectives of this work is to perform a new product introduction (NPI) for multi-sources data retrieval and integration. The result is a modular approach, customizable and suitable for the most of the integration and retrieval issues. The architecture proposed allows all the authorized users to access to the data in parallel, independently and on the same device, by running query straightly on the file through the Structured Query Language (SQL).

A Relational Database Management System Approach for Data Integration in Manufacturing Process

Corallo A.;Esposito M.;Massafra A.;Totaro S.
2018-01-01

Abstract

In this paper we introduce a data integration system by implementing a function into the context of PostgreSQL. The aim of this work is to collect files to process from two different data sources (a platform of Physical Testing Software (PTS) and another one of Physical Simulation Software (PSS)), in order to retrieve specific records through a query and integrate them. Both these platforms contain a large amount of files in semi-structured or unstructured format. This approach allows analysing data from different sources and creating a database remaining always in the PostgresSQL context. Indeed, the code is modular and it can be customizable for the specific scope. This approach reduces the information exchange with the client and the computational cost, because the instructions are applied all in once (on-the-fly) to the files stored in the Network File System (NFS). Furthermore, the integration outputs and the related data can be stored within it. One of the objectives of this work is to perform a new product introduction (NPI) for multi-sources data retrieval and integration. The result is a modular approach, customizable and suitable for the most of the integration and retrieval issues. The architecture proposed allows all the authorized users to access to the data in parallel, independently and on the same device, by running query straightly on the file through the Structured Query Language (SQL).
2018
978-1-5386-1469-3
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11587/431860
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 1
social impact