Application of latent Dirichlet allocation and autoencoder to real estate datasets

At present, there are too many types and numbers of real estate features in the real estate market, and it is difficult to effectively recommend real estate to customers with more complicated needs. The datasets of real estate often encompass a wide array of features, including categories, numerical...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of supercomputing 2025, Vol.81 (1), Article 118
Hauptverfasser: Gu, Runhe, Lin, Luchun
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:At present, there are too many types and numbers of real estate features in the real estate market, and it is difficult to effectively recommend real estate to customers with more complicated needs. The datasets of real estate often encompass a wide array of features, including categories, numerical values, and textual descriptions, which complicates the process of delivering precise and satisfactory recommendations. To address this issue, a cluster-based hybrid method combining latent Dirichlet allocation and autoencoder is proposed in this paper. It can effectively improve the early input of clustering by the two-stage feature extraction of the data through the topic model and the autoencoder. Experimental findings demonstrate that our hybrid method surpasses standalone LDA and autoencoder, as well as other conventional clustering algorithms. The combined results of Silhouette coefficient, DB index, and CH index are significantly better than the results of traditional clustering. Moreover, we conduct a comprehensive analysis of the results, emphasizing the benefits of our approach in managing multi-type features, encompassing categorical, numerical, and string data. It is anticipated that this research will contribute valuable insights into the field of recommendation systems.
ISSN:0920-8542
1573-0484
DOI:10.1007/s11227-024-06659-9