Method, device and apparatus for determining an accumulated independent access volume based on stream batch matching

The embodiment of the invention discloses a method for determining an accumulated independent access volume based on stream batch matching. The method comprises the following steps: acquiring a user access stream through a stream data source; extracting user access data from the user access stream a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: LEI JINWEI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The embodiment of the invention discloses a method for determining an accumulated independent access volume based on stream batch matching. The method comprises the following steps: acquiring a user access stream through a stream data source; extracting user access data from the user access stream according to a preset time interval to obtain a batch data source; creating a batch task according to the batch data source, executing the batch task to perform deduplication, and obtaining a historical access dimension table of which at least part is deduplicated; and creating a flow task corresponding to the current time period, and executing the flow task according to the current historical access dimension table and the flow data corresponding to the current time period in the user access flow so as to perform re-deduplication, thereby obtaining the accumulated independent access amount in the current time period. 本说明书实施例公开了一种基于流批配合的累计独立访问量确定方法,方法包括:通过流数据源获取用户访问流;按照预设的时间间隔从用户访问流中提取用户访问数据,得到批数据源;根据批数据源创建批任务,执行批任务