OmniLytics: A Blockchain-based Secure Data Market for Decentralized Machine Learning
We propose OmniLytics, a blockchain-based secure data trading marketplace for machine learning applications. Utilizing OmniLytics, many distributed data owners can contribute their private data to collectively train an ML model requested by some model owners, and receive compensation for data contri...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2021-11 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Liang, Jiacheng Li, Songze Cao, Bochuan Jiang, Wensi He, Chaoyang |
description | We propose OmniLytics, a blockchain-based secure data trading marketplace for machine learning applications. Utilizing OmniLytics, many distributed data owners can contribute their private data to collectively train an ML model requested by some model owners, and receive compensation for data contribution. OmniLytics enables such model training while simultaneously providing 1) model security against curious data owners; 2) data security against the curious model and data owners; 3) resilience to malicious data owners who provide faulty results to poison model training; and 4) resilience to malicious model owners who intend to evade payment. OmniLytics is implemented as a blockchain smart contract to guarantee the atomicity of payment. In OmniLytics, a model owner splits its model into the private and public parts and publishes the public part on the contract. Through the execution of the contract, the participating data owners securely aggregate their locally trained models to update the model owner's public model and receive reimbursement through the contract. We implement a working prototype of OmniLytics on Ethereum blockchain and perform extensive experiments to measure its gas cost, execution time, and model quality under various parameter combinations. For training a CNN on the MNIST dataset, the MO is able to boost its model accuracy from 62% to 83% within 500ms in blockchain processing time.This demonstrates the effectiveness of OmniLytics for practical deployment. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2550960386</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2550960386</sourcerecordid><originalsourceid>FETCH-proquest_journals_25509603863</originalsourceid><addsrcrecordid>eNqNjrEOgjAUABsTE4nyD02cSWoriG4qGgeIg-zkWR9SwKJtGfTrZfADnG64G25EPC7EIoiXnE-Ib23NGOPRioeh8Eh-fmiVvp2SdkO3dNd2spEVKB1cweKNXlD2BmkCDmgGpkFHy87QBCVqZ6BVnyHKQFZKI00RjFb6PiPjElqL_o9TMj8e8v0peJru1aN1Rd31Rg-qGC7YOmIijsR_1Rf-Ez-g</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2550960386</pqid></control><display><type>article</type><title>OmniLytics: A Blockchain-based Secure Data Market for Decentralized Machine Learning</title><source>Freely Accessible Journals</source><creator>Liang, Jiacheng ; Li, Songze ; Cao, Bochuan ; Jiang, Wensi ; He, Chaoyang</creator><creatorcontrib>Liang, Jiacheng ; Li, Songze ; Cao, Bochuan ; Jiang, Wensi ; He, Chaoyang</creatorcontrib><description>We propose OmniLytics, a blockchain-based secure data trading marketplace for machine learning applications. Utilizing OmniLytics, many distributed data owners can contribute their private data to collectively train an ML model requested by some model owners, and receive compensation for data contribution. OmniLytics enables such model training while simultaneously providing 1) model security against curious data owners; 2) data security against the curious model and data owners; 3) resilience to malicious data owners who provide faulty results to poison model training; and 4) resilience to malicious model owners who intend to evade payment. OmniLytics is implemented as a blockchain smart contract to guarantee the atomicity of payment. In OmniLytics, a model owner splits its model into the private and public parts and publishes the public part on the contract. Through the execution of the contract, the participating data owners securely aggregate their locally trained models to update the model owner's public model and receive reimbursement through the contract. We implement a working prototype of OmniLytics on Ethereum blockchain and perform extensive experiments to measure its gas cost, execution time, and model quality under various parameter combinations. For training a CNN on the MNIST dataset, the MO is able to boost its model accuracy from 62% to 83% within 500ms in blockchain processing time.This demonstrates the effectiveness of OmniLytics for practical deployment.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Blockchain ; Cryptography ; Machine learning ; Resilience ; Security ; Training</subject><ispartof>arXiv.org, 2021-11</ispartof><rights>2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>781,785</link.rule.ids></links><search><creatorcontrib>Liang, Jiacheng</creatorcontrib><creatorcontrib>Li, Songze</creatorcontrib><creatorcontrib>Cao, Bochuan</creatorcontrib><creatorcontrib>Jiang, Wensi</creatorcontrib><creatorcontrib>He, Chaoyang</creatorcontrib><title>OmniLytics: A Blockchain-based Secure Data Market for Decentralized Machine Learning</title><title>arXiv.org</title><description>We propose OmniLytics, a blockchain-based secure data trading marketplace for machine learning applications. Utilizing OmniLytics, many distributed data owners can contribute their private data to collectively train an ML model requested by some model owners, and receive compensation for data contribution. OmniLytics enables such model training while simultaneously providing 1) model security against curious data owners; 2) data security against the curious model and data owners; 3) resilience to malicious data owners who provide faulty results to poison model training; and 4) resilience to malicious model owners who intend to evade payment. OmniLytics is implemented as a blockchain smart contract to guarantee the atomicity of payment. In OmniLytics, a model owner splits its model into the private and public parts and publishes the public part on the contract. Through the execution of the contract, the participating data owners securely aggregate their locally trained models to update the model owner's public model and receive reimbursement through the contract. We implement a working prototype of OmniLytics on Ethereum blockchain and perform extensive experiments to measure its gas cost, execution time, and model quality under various parameter combinations. For training a CNN on the MNIST dataset, the MO is able to boost its model accuracy from 62% to 83% within 500ms in blockchain processing time.This demonstrates the effectiveness of OmniLytics for practical deployment.</description><subject>Blockchain</subject><subject>Cryptography</subject><subject>Machine learning</subject><subject>Resilience</subject><subject>Security</subject><subject>Training</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjrEOgjAUABsTE4nyD02cSWoriG4qGgeIg-zkWR9SwKJtGfTrZfADnG64G25EPC7EIoiXnE-Ib23NGOPRioeh8Eh-fmiVvp2SdkO3dNd2spEVKB1cweKNXlD2BmkCDmgGpkFHy87QBCVqZ6BVnyHKQFZKI00RjFb6PiPjElqL_o9TMj8e8v0peJru1aN1Rd31Rg-qGC7YOmIijsR_1Rf-Ez-g</recordid><startdate>20211115</startdate><enddate>20211115</enddate><creator>Liang, Jiacheng</creator><creator>Li, Songze</creator><creator>Cao, Bochuan</creator><creator>Jiang, Wensi</creator><creator>He, Chaoyang</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20211115</creationdate><title>OmniLytics: A Blockchain-based Secure Data Market for Decentralized Machine Learning</title><author>Liang, Jiacheng ; Li, Songze ; Cao, Bochuan ; Jiang, Wensi ; He, Chaoyang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_25509603863</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Blockchain</topic><topic>Cryptography</topic><topic>Machine learning</topic><topic>Resilience</topic><topic>Security</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>Liang, Jiacheng</creatorcontrib><creatorcontrib>Li, Songze</creatorcontrib><creatorcontrib>Cao, Bochuan</creatorcontrib><creatorcontrib>Jiang, Wensi</creatorcontrib><creatorcontrib>He, Chaoyang</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Liang, Jiacheng</au><au>Li, Songze</au><au>Cao, Bochuan</au><au>Jiang, Wensi</au><au>He, Chaoyang</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>OmniLytics: A Blockchain-based Secure Data Market for Decentralized Machine Learning</atitle><jtitle>arXiv.org</jtitle><date>2021-11-15</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>We propose OmniLytics, a blockchain-based secure data trading marketplace for machine learning applications. Utilizing OmniLytics, many distributed data owners can contribute their private data to collectively train an ML model requested by some model owners, and receive compensation for data contribution. OmniLytics enables such model training while simultaneously providing 1) model security against curious data owners; 2) data security against the curious model and data owners; 3) resilience to malicious data owners who provide faulty results to poison model training; and 4) resilience to malicious model owners who intend to evade payment. OmniLytics is implemented as a blockchain smart contract to guarantee the atomicity of payment. In OmniLytics, a model owner splits its model into the private and public parts and publishes the public part on the contract. Through the execution of the contract, the participating data owners securely aggregate their locally trained models to update the model owner's public model and receive reimbursement through the contract. We implement a working prototype of OmniLytics on Ethereum blockchain and perform extensive experiments to measure its gas cost, execution time, and model quality under various parameter combinations. For training a CNN on the MNIST dataset, the MO is able to boost its model accuracy from 62% to 83% within 500ms in blockchain processing time.This demonstrates the effectiveness of OmniLytics for practical deployment.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2021-11 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2550960386 |
source | Freely Accessible Journals |
subjects | Blockchain Cryptography Machine learning Resilience Security Training |
title | OmniLytics: A Blockchain-based Secure Data Market for Decentralized Machine Learning |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-16T22%3A05%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=OmniLytics:%20A%20Blockchain-based%20Secure%20Data%20Market%20for%20Decentralized%20Machine%20Learning&rft.jtitle=arXiv.org&rft.au=Liang,%20Jiacheng&rft.date=2021-11-15&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2550960386%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2550960386&rft_id=info:pmid/&rfr_iscdi=true |