Fraudulent and Legitimate Online Shops Dataset

The dataset contains fake (fraudulent) e-shops data together with legitimate e-shops data. The dataset is balanced and contains 1140 records of 579 fake (fraudulent) and 561 real (legitimate) online shops. Each record contains the following fields: 1. Online shop’s URL; 2. Label - {legitimate, fraud...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Liutkevicius, Agnius
Format: Dataset
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The dataset contains fake (fraudulent) e-shops data together with legitimate e-shops data. The dataset is balanced and contains 1140 records of 579 fake (fraudulent) and 561 real (legitimate) online shops. Each record contains the following fields: 1. Online shop’s URL; 2. Label - {legitimate, fraudulent}; 3. Domain length - Number of symbols in the host domain name; 4. Top domain length - Number of symbols in the top domain name; 5. Presence of prefix “www” in the active URL of the online shop, values {0 - no, 1 - yes}; 6. Number of digits in the URL; 7. Number of letters in the URL; 8. Number of dots (.) in the URL; 9. Number of hyphens (-) in the URL; 10. Presence of credit card payment, values {0 - no, 1 - yes}; 11. Presence of money back payment, including PayPal, Alipay, Apple Pay, Google Pay, Samsung Pay, and Amazon Pay, values {0 - no, 1 - yes}; 12. Presence of cash on delivery payment, values {0 - no, 1 - yes}; 13. Presence of the ability to use cryptocurrencies for payments, values {0 - no, 1 - yes}; 14. Presence of free contact emails, including Gmail, Hotmail, Outlook, Yahoo Mail, Zoho Mail, ProtonMail, iCloud Mail, GMX Mail, AOL Mail, mail.com, Yandex Mail, Mail2World, or Tutanota, values {0 – email address not found, 1 - free email address, 2 - domain email address, 3 – other email address}; 15. Presence of logo URL, values {0 - no, 1 - yes}; 16. SSL certificate issuer name; 17. SSL certificate expire date; 18. SSL certificate issuer organization name; 19. SSL certificate issuer organization ID, values {1 - Cloudflare, Inc., 2 - Let's Encrypt, 3 - Sectigo Limited, 4 - cPanel, Inc., 5 - GoDaddy.com, Inc., 6 - Amazon, 7 - DigiCert, Inc., 8 - GlobalSign nv-sa, 9 - Google Trust Services LLC, 10 - ZeroSSL, 11 - other organization}; 20. Indication of young domain, registered 400 days ago or later, values {0 - ‘old’ domain name, 1 - ‘young’ domain name, 2 - ‘hidden’}; 21. Domain registration date; 22. Presence of TrustPilot reviews, values {0 - no, 1 - yes}; 23. TrustPilot score, values - real number from 0 to 5 or -1 if no reviews are available; 24. Presence of SiteJabber reviews, values {0 - no, 1 - yes}; 25. Presence in the standard Tranco list, values {0 - no, 1 - yes}; 26. Tranco List rank, values - integer number from 1 to 1000000 or -1 if domain is not listed in the Tranco list.
DOI:10.17632/m7xtkx7g5m