Morphology based text compression

With the rapid growth of online information, the number of documents in electronic media is very common increased. Easy and quick access to this information gets more important for the purpose of text compression. In recent years, a portion of the work in the field of text compression covers study a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Göksu, Hayriye, Diri, B
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 48
container_issue
container_start_page 45
container_title
container_volume
creator Göksu, Hayriye
Diri, B
description With the rapid growth of online information, the number of documents in electronic media is very common increased. Easy and quick access to this information gets more important for the purpose of text compression. In recent years, a portion of the work in the field of text compression covers study aimed to the morphological structure of the language. In this study, Turkish and English documents are compressed in the determination of the different decomposition methods and efficiency, this method has been to investigate the effects of compression. Turkish and English documents are parsed by using morphological structure. The next stage in the parsed document structure is applied to the compression process with Huffman compression method. As a result, created 10 different parsing techniques with which attempts were made on a different corpus.
doi_str_mv 10.1109/SIU.2010.5651231
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5651231</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5651231</ieee_id><sourcerecordid>5651231</sourcerecordid><originalsourceid>FETCH-ieee_primary_56512313</originalsourceid><addsrcrecordid>eNp9zj0PgjAUheHrVyIqu4kL_gCwt4ULnY1GByd1JqhVMWBJyyD_XgZcnU7ePMsBmCMLEJlcHffngLO2IoqQC-yBK-MEQx6GkmKkPjicpPAFIQ1g8gOOwxaQIp8RS8bgWvtijCElXEjuwPKgTfXUhX403iWz6ubV6lN7V11WRlmb6_cMRvessMrtdgqL7ea03vm5UiqtTF5mpkm7V-K_fgGJIjPR</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Morphology based text compression</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Göksu, Hayriye ; Diri, B</creator><creatorcontrib>Göksu, Hayriye ; Diri, B</creatorcontrib><description>With the rapid growth of online information, the number of documents in electronic media is very common increased. Easy and quick access to this information gets more important for the purpose of text compression. In recent years, a portion of the work in the field of text compression covers study aimed to the morphological structure of the language. In this study, Turkish and English documents are compressed in the determination of the different decomposition methods and efficiency, this method has been to investigate the effects of compression. Turkish and English documents are parsed by using morphological structure. The next stage in the parsed document structure is applied to the compression process with Huffman compression method. As a result, created 10 different parsing techniques with which attempts were made on a different corpus.</description><identifier>ISSN: 2165-0608</identifier><identifier>ISBN: 1424496721</identifier><identifier>ISBN: 9781424496723</identifier><identifier>EISSN: 2693-3616</identifier><identifier>EISBN: 9781424496716</identifier><identifier>EISBN: 1424496713</identifier><identifier>EISBN: 9781424496709</identifier><identifier>EISBN: 1424496705</identifier><identifier>DOI: 10.1109/SIU.2010.5651231</identifier><language>eng</language><publisher>IEEE</publisher><subject>Computers ; Conferences ; Data compression ; Entropy ; Information technology ; Markov processes ; Morphology</subject><ispartof>2010 IEEE 18th Signal Processing and Communications Applications Conference, 2010, p.45-48</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5651231$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2051,27904,54898</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5651231$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Göksu, Hayriye</creatorcontrib><creatorcontrib>Diri, B</creatorcontrib><title>Morphology based text compression</title><title>2010 IEEE 18th Signal Processing and Communications Applications Conference</title><addtitle>SIU</addtitle><description>With the rapid growth of online information, the number of documents in electronic media is very common increased. Easy and quick access to this information gets more important for the purpose of text compression. In recent years, a portion of the work in the field of text compression covers study aimed to the morphological structure of the language. In this study, Turkish and English documents are compressed in the determination of the different decomposition methods and efficiency, this method has been to investigate the effects of compression. Turkish and English documents are parsed by using morphological structure. The next stage in the parsed document structure is applied to the compression process with Huffman compression method. As a result, created 10 different parsing techniques with which attempts were made on a different corpus.</description><subject>Computers</subject><subject>Conferences</subject><subject>Data compression</subject><subject>Entropy</subject><subject>Information technology</subject><subject>Markov processes</subject><subject>Morphology</subject><issn>2165-0608</issn><issn>2693-3616</issn><isbn>1424496721</isbn><isbn>9781424496723</isbn><isbn>9781424496716</isbn><isbn>1424496713</isbn><isbn>9781424496709</isbn><isbn>1424496705</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2010</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNp9zj0PgjAUheHrVyIqu4kL_gCwt4ULnY1GByd1JqhVMWBJyyD_XgZcnU7ePMsBmCMLEJlcHffngLO2IoqQC-yBK-MEQx6GkmKkPjicpPAFIQ1g8gOOwxaQIp8RS8bgWvtijCElXEjuwPKgTfXUhX403iWz6ubV6lN7V11WRlmb6_cMRvessMrtdgqL7ea03vm5UiqtTF5mpkm7V-K_fgGJIjPR</recordid><startdate>201004</startdate><enddate>201004</enddate><creator>Göksu, Hayriye</creator><creator>Diri, B</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201004</creationdate><title>Morphology based text compression</title><author>Göksu, Hayriye ; Diri, B</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-ieee_primary_56512313</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Computers</topic><topic>Conferences</topic><topic>Data compression</topic><topic>Entropy</topic><topic>Information technology</topic><topic>Markov processes</topic><topic>Morphology</topic><toplevel>online_resources</toplevel><creatorcontrib>Göksu, Hayriye</creatorcontrib><creatorcontrib>Diri, B</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Göksu, Hayriye</au><au>Diri, B</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Morphology based text compression</atitle><btitle>2010 IEEE 18th Signal Processing and Communications Applications Conference</btitle><stitle>SIU</stitle><date>2010-04</date><risdate>2010</risdate><spage>45</spage><epage>48</epage><pages>45-48</pages><issn>2165-0608</issn><eissn>2693-3616</eissn><isbn>1424496721</isbn><isbn>9781424496723</isbn><eisbn>9781424496716</eisbn><eisbn>1424496713</eisbn><eisbn>9781424496709</eisbn><eisbn>1424496705</eisbn><abstract>With the rapid growth of online information, the number of documents in electronic media is very common increased. Easy and quick access to this information gets more important for the purpose of text compression. In recent years, a portion of the work in the field of text compression covers study aimed to the morphological structure of the language. In this study, Turkish and English documents are compressed in the determination of the different decomposition methods and efficiency, this method has been to investigate the effects of compression. Turkish and English documents are parsed by using morphological structure. The next stage in the parsed document structure is applied to the compression process with Huffman compression method. As a result, created 10 different parsing techniques with which attempts were made on a different corpus.</abstract><pub>IEEE</pub><doi>10.1109/SIU.2010.5651231</doi></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 2165-0608
ispartof 2010 IEEE 18th Signal Processing and Communications Applications Conference, 2010, p.45-48
issn 2165-0608
2693-3616
language eng
recordid cdi_ieee_primary_5651231
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Computers
Conferences
Data compression
Entropy
Information technology
Markov processes
Morphology
title Morphology based text compression
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T23%3A27%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Morphology%20based%20text%20compression&rft.btitle=2010%20IEEE%2018th%20Signal%20Processing%20and%20Communications%20Applications%20Conference&rft.au=Go%CC%88ksu,%20Hayriye&rft.date=2010-04&rft.spage=45&rft.epage=48&rft.pages=45-48&rft.issn=2165-0608&rft.eissn=2693-3616&rft.isbn=1424496721&rft.isbn_list=9781424496723&rft_id=info:doi/10.1109/SIU.2010.5651231&rft_dat=%3Cieee_6IE%3E5651231%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=9781424496716&rft.eisbn_list=1424496713&rft.eisbn_list=9781424496709&rft.eisbn_list=1424496705&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5651231&rfr_iscdi=true