SimHash-based binary code similarity comparison method

The invention relates to a binary code similarity comparison method based on SimHash, and belongs to the field of code comparison. According to the method, the binary codes are disassembled, the assembly codes are preprocessed, the assembly codes are subjected to standardization processing, the SimH...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: YASUTSUNE, ZHANG JIANWEI, FU XIUFENG, JIA ZHANGTAO, FENG DACHENG, SHAO SA, KONG XIANGBING, LIU YUBO, TAO JINLONG, JIN YUCHUAN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator YASUTSUNE
ZHANG JIANWEI
FU XIUFENG
JIA ZHANGTAO
FENG DACHENG
SHAO SA
KONG XIANGBING
LIU YUBO
TAO JINLONG
JIN YUCHUAN
description The invention relates to a binary code similarity comparison method based on SimHash, and belongs to the field of code comparison. According to the method, the binary codes are disassembled, the assembly codes are preprocessed, the assembly codes are subjected to standardization processing, the SimHash values of the assembly codes are calculated, a code feature relation library framework is constructed, and the binary codes are rapidly positioned based on text similarity. The binary code similarity comparison method has the following advantages: the scheme provided by the invention can ensure the efficiency of binary code similarity comparison while giving consideration to the comparison efficiency; according to the method, the text comparison method based on SimHash is adopted, and the binary code similarity comparison efficiency can be improved. 本发明涉及一种基于SimHash的二进制代码相似性比对方法,属于代码比对领域。本发明对二进制代码反汇编及汇编代码预处理,对汇编代码标准化处理,计算汇编代码SimHash值,构建代码特征关系库构架,基于文本相似性的二进制代码快速定位。本发明具有以下优点:本发明提出的方案,能够在兼顾对比效率的同时,保证二进制代码相似性比对的效率;
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114995880A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114995880A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114995880A3</originalsourceid><addsrcrecordid>eNrjZDALzsz1SCzO0E1KLE5NUUjKzEssqlRIzk9JVSjOzM3MSSzKLAHxcwuArOL8PIXc1JKM_BQeBta0xJziVF4ozc2g6OYa4uyhm1qQH59aXJCYnJqXWhLv7GdoaGJpaWphYeBoTIwaAKp1LdM</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>SimHash-based binary code similarity comparison method</title><source>esp@cenet</source><creator>YASUTSUNE ; ZHANG JIANWEI ; FU XIUFENG ; JIA ZHANGTAO ; FENG DACHENG ; SHAO SA ; KONG XIANGBING ; LIU YUBO ; TAO JINLONG ; JIN YUCHUAN</creator><creatorcontrib>YASUTSUNE ; ZHANG JIANWEI ; FU XIUFENG ; JIA ZHANGTAO ; FENG DACHENG ; SHAO SA ; KONG XIANGBING ; LIU YUBO ; TAO JINLONG ; JIN YUCHUAN</creatorcontrib><description>The invention relates to a binary code similarity comparison method based on SimHash, and belongs to the field of code comparison. According to the method, the binary codes are disassembled, the assembly codes are preprocessed, the assembly codes are subjected to standardization processing, the SimHash values of the assembly codes are calculated, a code feature relation library framework is constructed, and the binary codes are rapidly positioned based on text similarity. The binary code similarity comparison method has the following advantages: the scheme provided by the invention can ensure the efficiency of binary code similarity comparison while giving consideration to the comparison efficiency; according to the method, the text comparison method based on SimHash is adopted, and the binary code similarity comparison efficiency can be improved. 本发明涉及一种基于SimHash的二进制代码相似性比对方法,属于代码比对领域。本发明对二进制代码反汇编及汇编代码预处理,对汇编代码标准化处理,计算汇编代码SimHash值,构建代码特征关系库构架,基于文本相似性的二进制代码快速定位。本发明具有以下优点:本发明提出的方案,能够在兼顾对比效率的同时,保证二进制代码相似性比对的效率;</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220902&amp;DB=EPODOC&amp;CC=CN&amp;NR=114995880A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,777,882,25545,76296</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220902&amp;DB=EPODOC&amp;CC=CN&amp;NR=114995880A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>YASUTSUNE</creatorcontrib><creatorcontrib>ZHANG JIANWEI</creatorcontrib><creatorcontrib>FU XIUFENG</creatorcontrib><creatorcontrib>JIA ZHANGTAO</creatorcontrib><creatorcontrib>FENG DACHENG</creatorcontrib><creatorcontrib>SHAO SA</creatorcontrib><creatorcontrib>KONG XIANGBING</creatorcontrib><creatorcontrib>LIU YUBO</creatorcontrib><creatorcontrib>TAO JINLONG</creatorcontrib><creatorcontrib>JIN YUCHUAN</creatorcontrib><title>SimHash-based binary code similarity comparison method</title><description>The invention relates to a binary code similarity comparison method based on SimHash, and belongs to the field of code comparison. According to the method, the binary codes are disassembled, the assembly codes are preprocessed, the assembly codes are subjected to standardization processing, the SimHash values of the assembly codes are calculated, a code feature relation library framework is constructed, and the binary codes are rapidly positioned based on text similarity. The binary code similarity comparison method has the following advantages: the scheme provided by the invention can ensure the efficiency of binary code similarity comparison while giving consideration to the comparison efficiency; according to the method, the text comparison method based on SimHash is adopted, and the binary code similarity comparison efficiency can be improved. 本发明涉及一种基于SimHash的二进制代码相似性比对方法,属于代码比对领域。本发明对二进制代码反汇编及汇编代码预处理,对汇编代码标准化处理,计算汇编代码SimHash值,构建代码特征关系库构架,基于文本相似性的二进制代码快速定位。本发明具有以下优点:本发明提出的方案,能够在兼顾对比效率的同时,保证二进制代码相似性比对的效率;</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDALzsz1SCzO0E1KLE5NUUjKzEssqlRIzk9JVSjOzM3MSSzKLAHxcwuArOL8PIXc1JKM_BQeBta0xJziVF4ozc2g6OYa4uyhm1qQH59aXJCYnJqXWhLv7GdoaGJpaWphYeBoTIwaAKp1LdM</recordid><startdate>20220902</startdate><enddate>20220902</enddate><creator>YASUTSUNE</creator><creator>ZHANG JIANWEI</creator><creator>FU XIUFENG</creator><creator>JIA ZHANGTAO</creator><creator>FENG DACHENG</creator><creator>SHAO SA</creator><creator>KONG XIANGBING</creator><creator>LIU YUBO</creator><creator>TAO JINLONG</creator><creator>JIN YUCHUAN</creator><scope>EVB</scope></search><sort><creationdate>20220902</creationdate><title>SimHash-based binary code similarity comparison method</title><author>YASUTSUNE ; ZHANG JIANWEI ; FU XIUFENG ; JIA ZHANGTAO ; FENG DACHENG ; SHAO SA ; KONG XIANGBING ; LIU YUBO ; TAO JINLONG ; JIN YUCHUAN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114995880A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>YASUTSUNE</creatorcontrib><creatorcontrib>ZHANG JIANWEI</creatorcontrib><creatorcontrib>FU XIUFENG</creatorcontrib><creatorcontrib>JIA ZHANGTAO</creatorcontrib><creatorcontrib>FENG DACHENG</creatorcontrib><creatorcontrib>SHAO SA</creatorcontrib><creatorcontrib>KONG XIANGBING</creatorcontrib><creatorcontrib>LIU YUBO</creatorcontrib><creatorcontrib>TAO JINLONG</creatorcontrib><creatorcontrib>JIN YUCHUAN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>YASUTSUNE</au><au>ZHANG JIANWEI</au><au>FU XIUFENG</au><au>JIA ZHANGTAO</au><au>FENG DACHENG</au><au>SHAO SA</au><au>KONG XIANGBING</au><au>LIU YUBO</au><au>TAO JINLONG</au><au>JIN YUCHUAN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>SimHash-based binary code similarity comparison method</title><date>2022-09-02</date><risdate>2022</risdate><abstract>The invention relates to a binary code similarity comparison method based on SimHash, and belongs to the field of code comparison. According to the method, the binary codes are disassembled, the assembly codes are preprocessed, the assembly codes are subjected to standardization processing, the SimHash values of the assembly codes are calculated, a code feature relation library framework is constructed, and the binary codes are rapidly positioned based on text similarity. The binary code similarity comparison method has the following advantages: the scheme provided by the invention can ensure the efficiency of binary code similarity comparison while giving consideration to the comparison efficiency; according to the method, the text comparison method based on SimHash is adopted, and the binary code similarity comparison efficiency can be improved. 本发明涉及一种基于SimHash的二进制代码相似性比对方法,属于代码比对领域。本发明对二进制代码反汇编及汇编代码预处理,对汇编代码标准化处理,计算汇编代码SimHash值,构建代码特征关系库构架,基于文本相似性的二进制代码快速定位。本发明具有以下优点:本发明提出的方案,能够在兼顾对比效率的同时,保证二进制代码相似性比对的效率;</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN114995880A
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title SimHash-based binary code similarity comparison method
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T14%3A45%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=YASUTSUNE&rft.date=2022-09-02&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114995880A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true