Large-scale literature data resource integration method and system

The invention provides a large-scale literature data resource integration method, which comprises the following steps: acquiring a storage structure of a large-scale literature data resource, performing source data field matching and cleaning rule setting according to the storage structure of the li...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ZHU YUHU, FENG LI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides a large-scale literature data resource integration method, which comprises the following steps: acquiring a storage structure of a large-scale literature data resource, performing source data field matching and cleaning rule setting according to the storage structure of the literature data resource and a storage form of target literature data, and performing mapping based on a template matching mode; reading the mapped data into a message queue by using an application message queue; performing data cleaning and data standardization processing operation on the data in the message queue; configuring a data field needing similarity monitoring in a data transmission process, a filter basic parameter and a storage mode of a target data source, taking the processed data out of the message queue, and putting the data into the target data source to realize data loading; and indexing the data from the data pool to complete data resource integration. 本申请提供了一种大规模文献数据资源整合方法,包括:获取大规模文献数据资源的存储结构,根据文献