Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city

This research data file contains the necessary software and the dataset for estimating the missing prices of house units. This approach combines several machine learning techniques (linear regression, support vector regression, the k-nearest neighbors and a multi-layer perceptron neural network) wit...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: García-Magariño, Iván
Format: Dataset
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator García-Magariño, Iván
description This research data file contains the necessary software and the dataset for estimating the missing prices of house units. This approach combines several machine learning techniques (linear regression, support vector regression, the k-nearest neighbors and a multi-layer perceptron neural network) with several dimensionality reduction techniques (non-negative factorization, recursive feature elimination and feature selection with a variance threshold). It includes the input dataset formed with the available house prices in two neighborhoods of Teruel city (Spain) in November 13, 2017 from Idealista website. These two neighborhoods are the center of the city and “Ensanche”. This dataset supports the research of the authors in the improvement of the setup of agent-based simulations about real-estate market. The work about this dataset has been submitted for consideration for publication to a scientific journal. The open source python code is composed of all the files with the “.py” extension. The main program can be executed from the “main.py” file. The “boxplotErrors.eps” is a chart generated from the execution of the code, and compares the results of the different combinations of machine learning techniques and dimensionality reduction methods. The dataset is in the “data” folder. The input raw data of the house prices are in the “dataRaw.csv” file. These were shuffled into the “dataShuffled.csv” file. We used cross-validation to obtain the estimations of house prices. The outputted estimations alongside the real values are stored in different files of the “data” folder, in which each filename is composed by the machine learning technique abbreviation and the dimensionality reduction method abbreviation.
doi_str_mv 10.17632/mxpgf54czz.2
format Dataset
fullrecord <record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_17632_mxpgf54czz_2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_17632_mxpgf54czz_2</sourcerecordid><originalsourceid>FETCH-datacite_primary_10_17632_mxpgf54czz_23</originalsourceid><addsrcrecordid>eNqVjjsOwjAQRN1QIKCk3wskJOF3AASipEhvWc46tojjyN4AyekxCERNtZrVe6NhbJlnab7frYuVfXS12m7kOKbFlN0uA2nXgnQVgnIeSCNgIGMFmfh3CqwJwbQ1dN5IDGBa8CiaJEKCEKzwVyS4G9IgoBIkQoxR064P-JWUdxZK9D02IA0NczZRogm4-NwZS07H8nBOXgURQB7FWD3wPOPv3fy3mxfrf_kn_xRVqg</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city</title><source>DataCite</source><creator>García-Magariño, Iván</creator><creatorcontrib>García-Magariño, Iván</creatorcontrib><description>This research data file contains the necessary software and the dataset for estimating the missing prices of house units. This approach combines several machine learning techniques (linear regression, support vector regression, the k-nearest neighbors and a multi-layer perceptron neural network) with several dimensionality reduction techniques (non-negative factorization, recursive feature elimination and feature selection with a variance threshold). It includes the input dataset formed with the available house prices in two neighborhoods of Teruel city (Spain) in November 13, 2017 from Idealista website. These two neighborhoods are the center of the city and “Ensanche”. This dataset supports the research of the authors in the improvement of the setup of agent-based simulations about real-estate market. The work about this dataset has been submitted for consideration for publication to a scientific journal. The open source python code is composed of all the files with the “.py” extension. The main program can be executed from the “main.py” file. The “boxplotErrors.eps” is a chart generated from the execution of the code, and compares the results of the different combinations of machine learning techniques and dimensionality reduction methods. The dataset is in the “data” folder. The input raw data of the house prices are in the “dataRaw.csv” file. These were shuffled into the “dataShuffled.csv” file. We used cross-validation to obtain the estimations of house prices. The outputted estimations alongside the real values are stored in different files of the “data” folder, in which each filename is composed by the machine learning technique abbreviation and the dimensionality reduction method abbreviation.</description><identifier>DOI: 10.17632/mxpgf54czz.2</identifier><language>eng</language><publisher>Mendeley</publisher><subject>Agent-Based Modeling ; Big Data ; Dimensionality Reduction ; Housing Market ; Machine Learning ; Multi-Agent Systems ; Software ; Software Agent</subject><creationdate>2017</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,1892</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.17632/mxpgf54czz.2$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>García-Magariño, Iván</creatorcontrib><title>Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city</title><description>This research data file contains the necessary software and the dataset for estimating the missing prices of house units. This approach combines several machine learning techniques (linear regression, support vector regression, the k-nearest neighbors and a multi-layer perceptron neural network) with several dimensionality reduction techniques (non-negative factorization, recursive feature elimination and feature selection with a variance threshold). It includes the input dataset formed with the available house prices in two neighborhoods of Teruel city (Spain) in November 13, 2017 from Idealista website. These two neighborhoods are the center of the city and “Ensanche”. This dataset supports the research of the authors in the improvement of the setup of agent-based simulations about real-estate market. The work about this dataset has been submitted for consideration for publication to a scientific journal. The open source python code is composed of all the files with the “.py” extension. The main program can be executed from the “main.py” file. The “boxplotErrors.eps” is a chart generated from the execution of the code, and compares the results of the different combinations of machine learning techniques and dimensionality reduction methods. The dataset is in the “data” folder. The input raw data of the house prices are in the “dataRaw.csv” file. These were shuffled into the “dataShuffled.csv” file. We used cross-validation to obtain the estimations of house prices. The outputted estimations alongside the real values are stored in different files of the “data” folder, in which each filename is composed by the machine learning technique abbreviation and the dimensionality reduction method abbreviation.</description><subject>Agent-Based Modeling</subject><subject>Big Data</subject><subject>Dimensionality Reduction</subject><subject>Housing Market</subject><subject>Machine Learning</subject><subject>Multi-Agent Systems</subject><subject>Software</subject><subject>Software Agent</subject><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2017</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNqVjjsOwjAQRN1QIKCk3wskJOF3AASipEhvWc46tojjyN4AyekxCERNtZrVe6NhbJlnab7frYuVfXS12m7kOKbFlN0uA2nXgnQVgnIeSCNgIGMFmfh3CqwJwbQ1dN5IDGBa8CiaJEKCEKzwVyS4G9IgoBIkQoxR064P-JWUdxZK9D02IA0NczZRogm4-NwZS07H8nBOXgURQB7FWD3wPOPv3fy3mxfrf_kn_xRVqg</recordid><startdate>20171212</startdate><enddate>20171212</enddate><creator>García-Magariño, Iván</creator><general>Mendeley</general><scope>DYCCY</scope><scope>PQ8</scope></search><sort><creationdate>20171212</creationdate><title>Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city</title><author>García-Magariño, Iván</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-datacite_primary_10_17632_mxpgf54czz_23</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Agent-Based Modeling</topic><topic>Big Data</topic><topic>Dimensionality Reduction</topic><topic>Housing Market</topic><topic>Machine Learning</topic><topic>Multi-Agent Systems</topic><topic>Software</topic><topic>Software Agent</topic><toplevel>online_resources</toplevel><creatorcontrib>García-Magariño, Iván</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>García-Magariño, Iván</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city</title><date>2017-12-12</date><risdate>2017</risdate><abstract>This research data file contains the necessary software and the dataset for estimating the missing prices of house units. This approach combines several machine learning techniques (linear regression, support vector regression, the k-nearest neighbors and a multi-layer perceptron neural network) with several dimensionality reduction techniques (non-negative factorization, recursive feature elimination and feature selection with a variance threshold). It includes the input dataset formed with the available house prices in two neighborhoods of Teruel city (Spain) in November 13, 2017 from Idealista website. These two neighborhoods are the center of the city and “Ensanche”. This dataset supports the research of the authors in the improvement of the setup of agent-based simulations about real-estate market. The work about this dataset has been submitted for consideration for publication to a scientific journal. The open source python code is composed of all the files with the “.py” extension. The main program can be executed from the “main.py” file. The “boxplotErrors.eps” is a chart generated from the execution of the code, and compares the results of the different combinations of machine learning techniques and dimensionality reduction methods. The dataset is in the “data” folder. The input raw data of the house prices are in the “dataRaw.csv” file. These were shuffled into the “dataShuffled.csv” file. We used cross-validation to obtain the estimations of house prices. The outputted estimations alongside the real values are stored in different files of the “data” folder, in which each filename is composed by the machine learning technique abbreviation and the dimensionality reduction method abbreviation.</abstract><pub>Mendeley</pub><doi>10.17632/mxpgf54czz.2</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.17632/mxpgf54czz.2
ispartof
issn
language eng
recordid cdi_datacite_primary_10_17632_mxpgf54czz_2
source DataCite
subjects Agent-Based Modeling
Big Data
Dimensionality Reduction
Housing Market
Machine Learning
Multi-Agent Systems
Software
Software Agent
title Python code for the estimation of missing prices in real-estate market with a dataset of house prices from Teruel city
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T19%3A55%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Garc%C3%ADa-Magari%C3%B1o,%20Iv%C3%A1n&rft.date=2017-12-12&rft_id=info:doi/10.17632/mxpgf54czz.2&rft_dat=%3Cdatacite_PQ8%3E10_17632_mxpgf54czz_2%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true