Patient-Centric Reddit Cancer Dataset
The Patient-Centric Reddit Cancer Dataset (PCRCD) consists of all posts from r/Cancer during the period 01/01/2014 and 04/30/2020. A total of 23,028 unique posts were extracted. These posts were labelled according to three patient demographic traits: patient versus caregiver, sex and age. Each post...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Dataset |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The Patient-Centric Reddit Cancer Dataset (PCRCD) consists of all posts from r/Cancer during the period 01/01/2014 and 04/30/2020. A total of 23,028 unique posts were extracted. These posts were labelled according to three patient demographic traits: patient versus caregiver, sex and age. Each post was independently labelled by three annotators. PCRCD can help with the understanding of the challenges and concerns being reported by the patients and caregivers dealing with cancer. It also exemplifies the potential and limitations associated with the translation of raw posts from online health forums into a common resource for researchers. |
---|---|
DOI: | 10.5281/zenodo.7308685 |