Development of online travel Web scraping for tourism statistics in Indonesia
Introduction. This research aims to study a novel approach to producing tourism statistics, especially accommodation statistics, in Indonesia using scraping of online travel agent Websites. Method. Accommodation data (e.g., room availability and price) were gathered from two of the largest online tr...
Gespeichert in:
Veröffentlicht in: | Information research 2020, Vol.25 (4) |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 4 |
container_start_page | |
container_title | Information research |
container_volume | 25 |
creator | Adhinugroho, Yustiar Putra, Amanda Luqman, Muhammad Ermawan, Geri Takdir, Takdir Mariyah, Siti Pramana, Setia |
description | Introduction. This research aims to study a novel approach to producing tourism statistics, especially accommodation statistics, in Indonesia using scraping of online travel agent Websites. Method. Accommodation data (e.g., room availability and price) were gathered from two of the largest online travel agencies in Indonesia. All data were collected automatically from the sites’ URLs listed in the sitemap. Analysis. The data were collected daily from 6 March to 27 July 2019. Datasets from the two Websites were merged. The room occupation rate (ROR) for each province was calculated and compared with the official statistics from Statistics Indonesia. Results. The results show that the online room occupancy rates and official statistics have a similar pattern indicating the use of the Web scraping technique provides valuable information, to measure the room occupation rate with an advantage in terms of cost and collection time. Conclusions. It is feasible to use big data as a proxy of or a complement to official statistics, especially in tourism statistics. By using the Web scraping technique, the indicator that usually requires significant time and cost can be done in real-time and less cost. This new approach would improve the quality of tourism statistics produced by BPS Statistics Indonesia. |
doi_str_mv | 10.47989/irpaper885 |
format | Article |
fullrecord | <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_47989_irpaper885</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_47989_irpaper885</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-6ff2c4c0c613b0941dc29bfe1d147424784f3bd9a3ffb7bd54c20459a10b97393</originalsourceid><addsrcrecordid>eNpNkMFLBCEchSUK2rZO_QPeY0pHZ9RjbLUtbHQpOg7qaBgzKv4s6L9vqKBO7_EOj48PoXNKLrlQUl2FknV2RcruAK0o62VDe8oO__VjdALwRkhLuOhW6OHGfbgp5dnFipPHKU4hOlyLXmb84gwGW3QO8RX7VHBN7yXAjKHqGqAGCzhEvItjig6CPkVHXk_gzn5zjZ7vbp82983-cbvbXO8b2wpSm9771nJL7MJjiOJ0tK0y3tGRcsFbLiT3zIxKM--NMGPH7YLbKU2JUYIptkYXP7-2JIDi_JBLmHX5HCgZvk0MfybYF-f5VBc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Development of online travel Web scraping for tourism statistics in Indonesia</title><source>Alma/SFX Local Collection</source><creator>Adhinugroho, Yustiar ; Putra, Amanda ; Luqman, Muhammad ; Ermawan, Geri ; Takdir, Takdir ; Mariyah, Siti ; Pramana, Setia</creator><creatorcontrib>Adhinugroho, Yustiar ; Putra, Amanda ; Luqman, Muhammad ; Ermawan, Geri ; Takdir, Takdir ; Mariyah, Siti ; Pramana, Setia ; Politeknik Statistika STIS, Jakarta, Indonesia</creatorcontrib><description>Introduction. This research aims to study a novel approach to producing tourism statistics, especially accommodation statistics, in Indonesia using scraping of online travel agent Websites. Method. Accommodation data (e.g., room availability and price) were gathered from two of the largest online travel agencies in Indonesia. All data were collected automatically from the sites’ URLs listed in the sitemap. Analysis. The data were collected daily from 6 March to 27 July 2019. Datasets from the two Websites were merged. The room occupation rate (ROR) for each province was calculated and compared with the official statistics from Statistics Indonesia. Results. The results show that the online room occupancy rates and official statistics have a similar pattern indicating the use of the Web scraping technique provides valuable information, to measure the room occupation rate with an advantage in terms of cost and collection time. Conclusions. It is feasible to use big data as a proxy of or a complement to official statistics, especially in tourism statistics. By using the Web scraping technique, the indicator that usually requires significant time and cost can be done in real-time and less cost. This new approach would improve the quality of tourism statistics produced by BPS Statistics Indonesia.</description><identifier>ISSN: 1368-1613</identifier><identifier>EISSN: 1368-1613</identifier><identifier>DOI: 10.47989/irpaper885</identifier><language>eng</language><ispartof>Information research, 2020, Vol.25 (4)</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c270t-6ff2c4c0c613b0941dc29bfe1d147424784f3bd9a3ffb7bd54c20459a10b97393</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,4010,27904,27905,27906</link.rule.ids></links><search><creatorcontrib>Adhinugroho, Yustiar</creatorcontrib><creatorcontrib>Putra, Amanda</creatorcontrib><creatorcontrib>Luqman, Muhammad</creatorcontrib><creatorcontrib>Ermawan, Geri</creatorcontrib><creatorcontrib>Takdir, Takdir</creatorcontrib><creatorcontrib>Mariyah, Siti</creatorcontrib><creatorcontrib>Pramana, Setia</creatorcontrib><creatorcontrib>Politeknik Statistika STIS, Jakarta, Indonesia</creatorcontrib><title>Development of online travel Web scraping for tourism statistics in Indonesia</title><title>Information research</title><description>Introduction. This research aims to study a novel approach to producing tourism statistics, especially accommodation statistics, in Indonesia using scraping of online travel agent Websites. Method. Accommodation data (e.g., room availability and price) were gathered from two of the largest online travel agencies in Indonesia. All data were collected automatically from the sites’ URLs listed in the sitemap. Analysis. The data were collected daily from 6 March to 27 July 2019. Datasets from the two Websites were merged. The room occupation rate (ROR) for each province was calculated and compared with the official statistics from Statistics Indonesia. Results. The results show that the online room occupancy rates and official statistics have a similar pattern indicating the use of the Web scraping technique provides valuable information, to measure the room occupation rate with an advantage in terms of cost and collection time. Conclusions. It is feasible to use big data as a proxy of or a complement to official statistics, especially in tourism statistics. By using the Web scraping technique, the indicator that usually requires significant time and cost can be done in real-time and less cost. This new approach would improve the quality of tourism statistics produced by BPS Statistics Indonesia.</description><issn>1368-1613</issn><issn>1368-1613</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNpNkMFLBCEchSUK2rZO_QPeY0pHZ9RjbLUtbHQpOg7qaBgzKv4s6L9vqKBO7_EOj48PoXNKLrlQUl2FknV2RcruAK0o62VDe8oO__VjdALwRkhLuOhW6OHGfbgp5dnFipPHKU4hOlyLXmb84gwGW3QO8RX7VHBN7yXAjKHqGqAGCzhEvItjig6CPkVHXk_gzn5zjZ7vbp82983-cbvbXO8b2wpSm9771nJL7MJjiOJ0tK0y3tGRcsFbLiT3zIxKM--NMGPH7YLbKU2JUYIptkYXP7-2JIDi_JBLmHX5HCgZvk0MfybYF-f5VBc</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Adhinugroho, Yustiar</creator><creator>Putra, Amanda</creator><creator>Luqman, Muhammad</creator><creator>Ermawan, Geri</creator><creator>Takdir, Takdir</creator><creator>Mariyah, Siti</creator><creator>Pramana, Setia</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>2020</creationdate><title>Development of online travel Web scraping for tourism statistics in Indonesia</title><author>Adhinugroho, Yustiar ; Putra, Amanda ; Luqman, Muhammad ; Ermawan, Geri ; Takdir, Takdir ; Mariyah, Siti ; Pramana, Setia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-6ff2c4c0c613b0941dc29bfe1d147424784f3bd9a3ffb7bd54c20459a10b97393</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Adhinugroho, Yustiar</creatorcontrib><creatorcontrib>Putra, Amanda</creatorcontrib><creatorcontrib>Luqman, Muhammad</creatorcontrib><creatorcontrib>Ermawan, Geri</creatorcontrib><creatorcontrib>Takdir, Takdir</creatorcontrib><creatorcontrib>Mariyah, Siti</creatorcontrib><creatorcontrib>Pramana, Setia</creatorcontrib><creatorcontrib>Politeknik Statistika STIS, Jakarta, Indonesia</creatorcontrib><collection>CrossRef</collection><jtitle>Information research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Adhinugroho, Yustiar</au><au>Putra, Amanda</au><au>Luqman, Muhammad</au><au>Ermawan, Geri</au><au>Takdir, Takdir</au><au>Mariyah, Siti</au><au>Pramana, Setia</au><aucorp>Politeknik Statistika STIS, Jakarta, Indonesia</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Development of online travel Web scraping for tourism statistics in Indonesia</atitle><jtitle>Information research</jtitle><date>2020</date><risdate>2020</risdate><volume>25</volume><issue>4</issue><issn>1368-1613</issn><eissn>1368-1613</eissn><abstract>Introduction. This research aims to study a novel approach to producing tourism statistics, especially accommodation statistics, in Indonesia using scraping of online travel agent Websites. Method. Accommodation data (e.g., room availability and price) were gathered from two of the largest online travel agencies in Indonesia. All data were collected automatically from the sites’ URLs listed in the sitemap. Analysis. The data were collected daily from 6 March to 27 July 2019. Datasets from the two Websites were merged. The room occupation rate (ROR) for each province was calculated and compared with the official statistics from Statistics Indonesia. Results. The results show that the online room occupancy rates and official statistics have a similar pattern indicating the use of the Web scraping technique provides valuable information, to measure the room occupation rate with an advantage in terms of cost and collection time. Conclusions. It is feasible to use big data as a proxy of or a complement to official statistics, especially in tourism statistics. By using the Web scraping technique, the indicator that usually requires significant time and cost can be done in real-time and less cost. This new approach would improve the quality of tourism statistics produced by BPS Statistics Indonesia.</abstract><doi>10.47989/irpaper885</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1368-1613 |
ispartof | Information research, 2020, Vol.25 (4) |
issn | 1368-1613 1368-1613 |
language | eng |
recordid | cdi_crossref_primary_10_47989_irpaper885 |
source | Alma/SFX Local Collection |
title | Development of online travel Web scraping for tourism statistics in Indonesia |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T03%3A08%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Development%20of%20online%20travel%20Web%20scraping%20for%20tourism%20statistics%20in%20Indonesia&rft.jtitle=Information%20research&rft.au=Adhinugroho,%20Yustiar&rft.aucorp=Politeknik%20Statistika%20STIS,%20Jakarta,%20Indonesia&rft.date=2020&rft.volume=25&rft.issue=4&rft.issn=1368-1613&rft.eissn=1368-1613&rft_id=info:doi/10.47989/irpaper885&rft_dat=%3Ccrossref%3E10_47989_irpaper885%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |