Morphology based text compression
With the rapid growth of online information, the number of documents in electronic media is very common increased. Easy and quick access to this information gets more important for the purpose of text compression. In recent years, a portion of the work in the field of text compression covers study a...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 48 |
---|---|
container_issue | |
container_start_page | 45 |
container_title | |
container_volume | |
creator | Göksu, Hayriye Diri, B |
description | With the rapid growth of online information, the number of documents in electronic media is very common increased. Easy and quick access to this information gets more important for the purpose of text compression. In recent years, a portion of the work in the field of text compression covers study aimed to the morphological structure of the language. In this study, Turkish and English documents are compressed in the determination of the different decomposition methods and efficiency, this method has been to investigate the effects of compression. Turkish and English documents are parsed by using morphological structure. The next stage in the parsed document structure is applied to the compression process with Huffman compression method. As a result, created 10 different parsing techniques with which attempts were made on a different corpus. |
doi_str_mv | 10.1109/SIU.2010.5651231 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5651231</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5651231</ieee_id><sourcerecordid>5651231</sourcerecordid><originalsourceid>FETCH-ieee_primary_56512313</originalsourceid><addsrcrecordid>eNp9zj0PgjAUheHrVyIqu4kL_gCwt4ULnY1GByd1JqhVMWBJyyD_XgZcnU7ePMsBmCMLEJlcHffngLO2IoqQC-yBK-MEQx6GkmKkPjicpPAFIQ1g8gOOwxaQIp8RS8bgWvtijCElXEjuwPKgTfXUhX403iWz6ubV6lN7V11WRlmb6_cMRvessMrtdgqL7ea03vm5UiqtTF5mpkm7V-K_fgGJIjPR</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Morphology based text compression</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Göksu, Hayriye ; Diri, B</creator><creatorcontrib>Göksu, Hayriye ; Diri, B</creatorcontrib><description>With the rapid growth of online information, the number of documents in electronic media is very common increased. Easy and quick access to this information gets more important for the purpose of text compression. In recent years, a portion of the work in the field of text compression covers study aimed to the morphological structure of the language. In this study, Turkish and English documents are compressed in the determination of the different decomposition methods and efficiency, this method has been to investigate the effects of compression. Turkish and English documents are parsed by using morphological structure. The next stage in the parsed document structure is applied to the compression process with Huffman compression method. As a result, created 10 different parsing techniques with which attempts were made on a different corpus.</description><identifier>ISSN: 2165-0608</identifier><identifier>ISBN: 1424496721</identifier><identifier>ISBN: 9781424496723</identifier><identifier>EISSN: 2693-3616</identifier><identifier>EISBN: 9781424496716</identifier><identifier>EISBN: 1424496713</identifier><identifier>EISBN: 9781424496709</identifier><identifier>EISBN: 1424496705</identifier><identifier>DOI: 10.1109/SIU.2010.5651231</identifier><language>eng</language><publisher>IEEE</publisher><subject>Computers ; Conferences ; Data compression ; Entropy ; Information technology ; Markov processes ; Morphology</subject><ispartof>2010 IEEE 18th Signal Processing and Communications Applications Conference, 2010, p.45-48</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5651231$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2051,27904,54898</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5651231$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Göksu, Hayriye</creatorcontrib><creatorcontrib>Diri, B</creatorcontrib><title>Morphology based text compression</title><title>2010 IEEE 18th Signal Processing and Communications Applications Conference</title><addtitle>SIU</addtitle><description>With the rapid growth of online information, the number of documents in electronic media is very common increased. Easy and quick access to this information gets more important for the purpose of text compression. In recent years, a portion of the work in the field of text compression covers study aimed to the morphological structure of the language. In this study, Turkish and English documents are compressed in the determination of the different decomposition methods and efficiency, this method has been to investigate the effects of compression. Turkish and English documents are parsed by using morphological structure. The next stage in the parsed document structure is applied to the compression process with Huffman compression method. As a result, created 10 different parsing techniques with which attempts were made on a different corpus.</description><subject>Computers</subject><subject>Conferences</subject><subject>Data compression</subject><subject>Entropy</subject><subject>Information technology</subject><subject>Markov processes</subject><subject>Morphology</subject><issn>2165-0608</issn><issn>2693-3616</issn><isbn>1424496721</isbn><isbn>9781424496723</isbn><isbn>9781424496716</isbn><isbn>1424496713</isbn><isbn>9781424496709</isbn><isbn>1424496705</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2010</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNp9zj0PgjAUheHrVyIqu4kL_gCwt4ULnY1GByd1JqhVMWBJyyD_XgZcnU7ePMsBmCMLEJlcHffngLO2IoqQC-yBK-MEQx6GkmKkPjicpPAFIQ1g8gOOwxaQIp8RS8bgWvtijCElXEjuwPKgTfXUhX403iWz6ubV6lN7V11WRlmb6_cMRvessMrtdgqL7ea03vm5UiqtTF5mpkm7V-K_fgGJIjPR</recordid><startdate>201004</startdate><enddate>201004</enddate><creator>Göksu, Hayriye</creator><creator>Diri, B</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201004</creationdate><title>Morphology based text compression</title><author>Göksu, Hayriye ; Diri, B</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-ieee_primary_56512313</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Computers</topic><topic>Conferences</topic><topic>Data compression</topic><topic>Entropy</topic><topic>Information technology</topic><topic>Markov processes</topic><topic>Morphology</topic><toplevel>online_resources</toplevel><creatorcontrib>Göksu, Hayriye</creatorcontrib><creatorcontrib>Diri, B</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Göksu, Hayriye</au><au>Diri, B</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Morphology based text compression</atitle><btitle>2010 IEEE 18th Signal Processing and Communications Applications Conference</btitle><stitle>SIU</stitle><date>2010-04</date><risdate>2010</risdate><spage>45</spage><epage>48</epage><pages>45-48</pages><issn>2165-0608</issn><eissn>2693-3616</eissn><isbn>1424496721</isbn><isbn>9781424496723</isbn><eisbn>9781424496716</eisbn><eisbn>1424496713</eisbn><eisbn>9781424496709</eisbn><eisbn>1424496705</eisbn><abstract>With the rapid growth of online information, the number of documents in electronic media is very common increased. Easy and quick access to this information gets more important for the purpose of text compression. In recent years, a portion of the work in the field of text compression covers study aimed to the morphological structure of the language. In this study, Turkish and English documents are compressed in the determination of the different decomposition methods and efficiency, this method has been to investigate the effects of compression. Turkish and English documents are parsed by using morphological structure. The next stage in the parsed document structure is applied to the compression process with Huffman compression method. As a result, created 10 different parsing techniques with which attempts were made on a different corpus.</abstract><pub>IEEE</pub><doi>10.1109/SIU.2010.5651231</doi></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 2165-0608 |
ispartof | 2010 IEEE 18th Signal Processing and Communications Applications Conference, 2010, p.45-48 |
issn | 2165-0608 2693-3616 |
language | eng |
recordid | cdi_ieee_primary_5651231 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Computers Conferences Data compression Entropy Information technology Markov processes Morphology |
title | Morphology based text compression |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T23%3A27%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Morphology%20based%20text%20compression&rft.btitle=2010%20IEEE%2018th%20Signal%20Processing%20and%20Communications%20Applications%20Conference&rft.au=Go%CC%88ksu,%20Hayriye&rft.date=2010-04&rft.spage=45&rft.epage=48&rft.pages=45-48&rft.issn=2165-0608&rft.eissn=2693-3616&rft.isbn=1424496721&rft.isbn_list=9781424496723&rft_id=info:doi/10.1109/SIU.2010.5651231&rft_dat=%3Cieee_6IE%3E5651231%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=9781424496716&rft.eisbn_list=1424496713&rft.eisbn_list=9781424496709&rft.eisbn_list=1424496705&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5651231&rfr_iscdi=true |