Development of online travel Web scraping for tourism statistics in Indonesia

Introduction. This research aims to study a novel approach to producing tourism statistics, especially accommodation statistics, in Indonesia using scraping of online travel agent Websites. Method. Accommodation data (e.g., room availability and price) were gathered from two of the largest online tr...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information research 2020, Vol.25 (4)
Hauptverfasser: Adhinugroho, Yustiar, Putra, Amanda, Luqman, Muhammad, Ermawan, Geri, Takdir, Takdir, Mariyah, Siti, Pramana, Setia
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 4
container_start_page
container_title Information research
container_volume 25
creator Adhinugroho, Yustiar
Putra, Amanda
Luqman, Muhammad
Ermawan, Geri
Takdir, Takdir
Mariyah, Siti
Pramana, Setia
description Introduction. This research aims to study a novel approach to producing tourism statistics, especially accommodation statistics, in Indonesia using scraping of online travel agent Websites. Method. Accommodation data (e.g., room availability and price) were gathered from two of the largest online travel agencies in Indonesia. All data were collected automatically from the sites’ URLs listed in the sitemap. Analysis. The data were collected daily from 6 March to 27 July 2019. Datasets from the two Websites were merged. The room occupation rate (ROR) for each province was calculated and compared with the official statistics from Statistics Indonesia. Results. The results show that the online room occupancy rates and official statistics have a similar pattern indicating the use of the Web scraping technique provides valuable information, to measure the room occupation rate with an advantage in terms of cost and collection time. Conclusions. It is feasible to use big data as a proxy of or a complement to official statistics, especially in tourism statistics. By using the Web scraping technique, the indicator that usually requires significant time and cost can be done in real-time and less cost. This new approach would improve the quality of tourism statistics produced by BPS Statistics Indonesia.
doi_str_mv 10.47989/irpaper885
format Article
fullrecord <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_47989_irpaper885</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_47989_irpaper885</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-6ff2c4c0c613b0941dc29bfe1d147424784f3bd9a3ffb7bd54c20459a10b97393</originalsourceid><addsrcrecordid>eNpNkMFLBCEchSUK2rZO_QPeY0pHZ9RjbLUtbHQpOg7qaBgzKv4s6L9vqKBO7_EOj48PoXNKLrlQUl2FknV2RcruAK0o62VDe8oO__VjdALwRkhLuOhW6OHGfbgp5dnFipPHKU4hOlyLXmb84gwGW3QO8RX7VHBN7yXAjKHqGqAGCzhEvItjig6CPkVHXk_gzn5zjZ7vbp82983-cbvbXO8b2wpSm9771nJL7MJjiOJ0tK0y3tGRcsFbLiT3zIxKM--NMGPH7YLbKU2JUYIptkYXP7-2JIDi_JBLmHX5HCgZvk0MfybYF-f5VBc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Development of online travel Web scraping for tourism statistics in Indonesia</title><source>Alma/SFX Local Collection</source><creator>Adhinugroho, Yustiar ; Putra, Amanda ; Luqman, Muhammad ; Ermawan, Geri ; Takdir, Takdir ; Mariyah, Siti ; Pramana, Setia</creator><creatorcontrib>Adhinugroho, Yustiar ; Putra, Amanda ; Luqman, Muhammad ; Ermawan, Geri ; Takdir, Takdir ; Mariyah, Siti ; Pramana, Setia ; Politeknik Statistika STIS, Jakarta, Indonesia</creatorcontrib><description>Introduction. This research aims to study a novel approach to producing tourism statistics, especially accommodation statistics, in Indonesia using scraping of online travel agent Websites. Method. Accommodation data (e.g., room availability and price) were gathered from two of the largest online travel agencies in Indonesia. All data were collected automatically from the sites’ URLs listed in the sitemap. Analysis. The data were collected daily from 6 March to 27 July 2019. Datasets from the two Websites were merged. The room occupation rate (ROR) for each province was calculated and compared with the official statistics from Statistics Indonesia. Results. The results show that the online room occupancy rates and official statistics have a similar pattern indicating the use of the Web scraping technique provides valuable information, to measure the room occupation rate with an advantage in terms of cost and collection time. Conclusions. It is feasible to use big data as a proxy of or a complement to official statistics, especially in tourism statistics. By using the Web scraping technique, the indicator that usually requires significant time and cost can be done in real-time and less cost. This new approach would improve the quality of tourism statistics produced by BPS Statistics Indonesia.</description><identifier>ISSN: 1368-1613</identifier><identifier>EISSN: 1368-1613</identifier><identifier>DOI: 10.47989/irpaper885</identifier><language>eng</language><ispartof>Information research, 2020, Vol.25 (4)</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c270t-6ff2c4c0c613b0941dc29bfe1d147424784f3bd9a3ffb7bd54c20459a10b97393</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,4010,27904,27905,27906</link.rule.ids></links><search><creatorcontrib>Adhinugroho, Yustiar</creatorcontrib><creatorcontrib>Putra, Amanda</creatorcontrib><creatorcontrib>Luqman, Muhammad</creatorcontrib><creatorcontrib>Ermawan, Geri</creatorcontrib><creatorcontrib>Takdir, Takdir</creatorcontrib><creatorcontrib>Mariyah, Siti</creatorcontrib><creatorcontrib>Pramana, Setia</creatorcontrib><creatorcontrib>Politeknik Statistika STIS, Jakarta, Indonesia</creatorcontrib><title>Development of online travel Web scraping for tourism statistics in Indonesia</title><title>Information research</title><description>Introduction. This research aims to study a novel approach to producing tourism statistics, especially accommodation statistics, in Indonesia using scraping of online travel agent Websites. Method. Accommodation data (e.g., room availability and price) were gathered from two of the largest online travel agencies in Indonesia. All data were collected automatically from the sites’ URLs listed in the sitemap. Analysis. The data were collected daily from 6 March to 27 July 2019. Datasets from the two Websites were merged. The room occupation rate (ROR) for each province was calculated and compared with the official statistics from Statistics Indonesia. Results. The results show that the online room occupancy rates and official statistics have a similar pattern indicating the use of the Web scraping technique provides valuable information, to measure the room occupation rate with an advantage in terms of cost and collection time. Conclusions. It is feasible to use big data as a proxy of or a complement to official statistics, especially in tourism statistics. By using the Web scraping technique, the indicator that usually requires significant time and cost can be done in real-time and less cost. This new approach would improve the quality of tourism statistics produced by BPS Statistics Indonesia.</description><issn>1368-1613</issn><issn>1368-1613</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNpNkMFLBCEchSUK2rZO_QPeY0pHZ9RjbLUtbHQpOg7qaBgzKv4s6L9vqKBO7_EOj48PoXNKLrlQUl2FknV2RcruAK0o62VDe8oO__VjdALwRkhLuOhW6OHGfbgp5dnFipPHKU4hOlyLXmb84gwGW3QO8RX7VHBN7yXAjKHqGqAGCzhEvItjig6CPkVHXk_gzn5zjZ7vbp82983-cbvbXO8b2wpSm9771nJL7MJjiOJ0tK0y3tGRcsFbLiT3zIxKM--NMGPH7YLbKU2JUYIptkYXP7-2JIDi_JBLmHX5HCgZvk0MfybYF-f5VBc</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Adhinugroho, Yustiar</creator><creator>Putra, Amanda</creator><creator>Luqman, Muhammad</creator><creator>Ermawan, Geri</creator><creator>Takdir, Takdir</creator><creator>Mariyah, Siti</creator><creator>Pramana, Setia</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>2020</creationdate><title>Development of online travel Web scraping for tourism statistics in Indonesia</title><author>Adhinugroho, Yustiar ; Putra, Amanda ; Luqman, Muhammad ; Ermawan, Geri ; Takdir, Takdir ; Mariyah, Siti ; Pramana, Setia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-6ff2c4c0c613b0941dc29bfe1d147424784f3bd9a3ffb7bd54c20459a10b97393</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Adhinugroho, Yustiar</creatorcontrib><creatorcontrib>Putra, Amanda</creatorcontrib><creatorcontrib>Luqman, Muhammad</creatorcontrib><creatorcontrib>Ermawan, Geri</creatorcontrib><creatorcontrib>Takdir, Takdir</creatorcontrib><creatorcontrib>Mariyah, Siti</creatorcontrib><creatorcontrib>Pramana, Setia</creatorcontrib><creatorcontrib>Politeknik Statistika STIS, Jakarta, Indonesia</creatorcontrib><collection>CrossRef</collection><jtitle>Information research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Adhinugroho, Yustiar</au><au>Putra, Amanda</au><au>Luqman, Muhammad</au><au>Ermawan, Geri</au><au>Takdir, Takdir</au><au>Mariyah, Siti</au><au>Pramana, Setia</au><aucorp>Politeknik Statistika STIS, Jakarta, Indonesia</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Development of online travel Web scraping for tourism statistics in Indonesia</atitle><jtitle>Information research</jtitle><date>2020</date><risdate>2020</risdate><volume>25</volume><issue>4</issue><issn>1368-1613</issn><eissn>1368-1613</eissn><abstract>Introduction. This research aims to study a novel approach to producing tourism statistics, especially accommodation statistics, in Indonesia using scraping of online travel agent Websites. Method. Accommodation data (e.g., room availability and price) were gathered from two of the largest online travel agencies in Indonesia. All data were collected automatically from the sites’ URLs listed in the sitemap. Analysis. The data were collected daily from 6 March to 27 July 2019. Datasets from the two Websites were merged. The room occupation rate (ROR) for each province was calculated and compared with the official statistics from Statistics Indonesia. Results. The results show that the online room occupancy rates and official statistics have a similar pattern indicating the use of the Web scraping technique provides valuable information, to measure the room occupation rate with an advantage in terms of cost and collection time. Conclusions. It is feasible to use big data as a proxy of or a complement to official statistics, especially in tourism statistics. By using the Web scraping technique, the indicator that usually requires significant time and cost can be done in real-time and less cost. This new approach would improve the quality of tourism statistics produced by BPS Statistics Indonesia.</abstract><doi>10.47989/irpaper885</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1368-1613
ispartof Information research, 2020, Vol.25 (4)
issn 1368-1613
1368-1613
language eng
recordid cdi_crossref_primary_10_47989_irpaper885
source Alma/SFX Local Collection
title Development of online travel Web scraping for tourism statistics in Indonesia
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T03%3A08%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Development%20of%20online%20travel%20Web%20scraping%20for%20tourism%20statistics%20in%20Indonesia&rft.jtitle=Information%20research&rft.au=Adhinugroho,%20Yustiar&rft.aucorp=Politeknik%20Statistika%20STIS,%20Jakarta,%20Indonesia&rft.date=2020&rft.volume=25&rft.issue=4&rft.issn=1368-1613&rft.eissn=1368-1613&rft_id=info:doi/10.47989/irpaper885&rft_dat=%3Ccrossref%3E10_47989_irpaper885%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true