Inductive inference for large scale text classification Kernel approaches and techniques

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Silva, Catarina (VerfasserIn), Ribeiro, Bernardete (VerfasserIn)
Format: Buch
Sprache:English
Veröffentlicht: Berlin [u.a.] Springer 2010
Schriftenreihe:Studies in computational intelligence 255
Online-Zugang:Inhaltsverzeichnis
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!

MARC

LEADER 00000nam a22000002cb4500
001 BV036524382
003 DE-604
005 00000000000000.0
007 t|
008 100625s2010 xx d||| |||| 00||| eng d
020 |a 9783642045325  |9 978-3-642-04532-5 
020 |z 9783642045332  |c eISBN  |9 978-3-642-04533-2 
035 |a (OCoLC)553585436 
035 |a (DE-599)BVBBV036524382 
040 |a DE-604  |b ger  |e rakwb 
041 0 |a eng 
049 |a DE-11 
082 0 |a 006.35  |2 22/ger 
084 |a ST 300  |0 (DE-625)143650:  |2 rvk 
100 1 |a Silva, Catarina  |e Verfasser  |4 aut 
245 1 0 |a Inductive inference for large scale text classification  |b Kernel approaches and techniques  |c Catarina Silva and Bernadete Ribeiro 
264 1 |a Berlin [u.a.]  |b Springer  |c 2010 
300 |a XX, 155 S.  |b graph. Darst. 
336 |b txt  |2 rdacontent 
337 |b n  |2 rdamedia 
338 |b nc  |2 rdacarrier 
490 1 |a Studies in computational intelligence  |v 255 
700 1 |a Ribeiro, Bernardete  |e Verfasser  |4 aut 
830 0 |a Studies in computational intelligence  |v 255  |w (DE-604)BV020822171  |9 255 
856 4 2 |m DNB Datenaustausch  |q application/pdf  |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020446326&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA  |3 Inhaltsverzeichnis 
943 1 |a oai:aleph.bib-bvb.de:BVB01-020446326 

Datensatz im Suchindex

_version_ 1819659789699907584
adam_text CONTENTS PART I: FUNDAMENTALS 1 BACKGROUND ON TEXT CLASSIFICATION 3 1.1 PROBLEM SETTING 3 1.2 APPLICATIONS OF TEXT CLASSIFICATION 5 1.2.1 DOCUMENT ORGANIZATION 5 1.2.2 TEXT FILTERING 5 1.2.3 WORD SENSE DISAMBIGUATION 5 1.2.4 OTHER APPLICATIONS 6 1.3 DOCUMENT REPRESENTATION 6 1.4 PRE-PROCESSING TEXT 8 1.4.1 FEATURE SELECTION 8 1.4.2 FEATURE EXTRACTION 10 1.5 CLASSIFIERS 11 1.5.1 ROCCHIO S METHOD 12 1.5.2 DECISION TREES AND RULES 13 1.5.3 NAIVE BAYES 15 1.5.4 K-NEAREST NEIGHBOR 15 1.5.5 NEURAL NETWORKS 16 1.5.6 KERNEL-BASED LEARNING MACHINES 17 1.5.7 COMMITTEES 18 1.5.8 ACTIVE LEARNING 20 1.5.9 OTHER METHODS 21 1.6 EVALUATION 21 1.6.1 PERFORMANCE CRITERIA 22 1.6.2 DOCUMENT CORPORA 24 1.7 EVALUATION OF PRE-PROCESSING METHODS 26 1.8 CONCLUSION 29 BIBLIOGRAFISCHE INFORMATIONEN HTTP://D-NB.INFO/995947503 DIGITALISIERT DURCH 4.5 CONCLUSION 89 XIV CONTENTS 2 KERNEL MACHINES FOR TEXT CLASSIFICATION 31 2.1 KERNEL METHODS 31 2.2 SUPPORT VECTOR MACHINES 32 2.2.1 LINEAR HARD-MARGIN SVMS 33 2.2.2 SOFT-MARGIN SVMS 36 2.2.3 NONLINEAR SVMS 37 2.3 RELEVANCE VECTOR MACHINES 38 2.3.1 BAYESIAN APPROACHES 39 2.3.2 RVM APPROACH 40 2.4 BASELINE KERNEL MACHINES PERFORMANCES WITH BENCHMARK CORPORA 43 2.4.1 SVM PERFORMANCE 44 2.4.2 RVM PERFORMANCE 46 2.4.3 DISCUSSION 47 2.5 CONCLUSION 48 PART II: APPROACHES AND TECHNIQUES 3 ENHANCING SVMS FOR TEXT CLASSIFICATION 51 3.1 INCORPORATING UNLABELED DATA 51 3.1.1 BACKGROUND KNOWLEDGE AND ACTIVE LEARNING 53 3.1.2 EXPERIMENTAL RESULTS 56 3.1.3 COMBINING BACKGROUND KNOWLEDGE AND ACTIVE LEARNING 59 3.1.4 ANALYSIS OF RESULTS 60 3.2 USING MULTIPLE CLASSIFIERS 63 3.2.1 SVM ENSEMBLES 65 3.2.2 EXPERIMENTAL RESULTS AND ANALYSIS 66 3.3 CONCLUSION 69 4 SCALING RVMS FOR TEXT CLASSIFICATION 71 4.1 INTRODUCTION 71 4.2 SCALE REDUCTION APPROACHES 72 4.2.1 ACTIVE LEARNING 73 4.2.2 SIMILITUDE MEASURE 76 4.3 DIVIDE-AND-CONQUER APPROACHES 78 4.3.1 INCREMENTAL RVM 79 4.3.2 RVM BOOSTING 80 4.3.3 RVM ENSEMBLE 83 4.3.4 ANALYSIS OF RESULTS 84 4.4 HYBRID RVM-SVM APPROACH 86 CONTENTS XV 5 DISTRIBUTING TEXT CLASSIFICATION IN GRID ENVIRONMENTS ... 93 5.1 INTRODUCTION 93 5.2 RELATED WORK 94 5.2.1 DISTRIBUTED COMPUTING PLATFORMS 94 5.2.2 DISTRIBUTED APPLICATIONS 95 5.3 DEPLOYMENT IN THE DISTRIBUTED ENVIRONMENT 97 5.3.1 TASK SCHEDULING AND DIRECT ACYCLIC GRAPHS 97 5.3.2 DAG DESIGN IN A DISTRIBUTED ENVIRONMENT 97 5.3.3 DISTRIBUTED ENVIRONMENT FOR THE EXPERIMENTAL SETUP 100 5.3.4 MODEL OF THE ENVIRONMENT 100 5.4 DESIGN OF DISTRIBUTED TEXT CLASSIFICATION SCHEDULING SCHEMES 102 5.4.1 DATAFLOW IN TEXT CLASSIFICATION 102 5.4.2 OPTIMIZATION OF SCHEDULING SCHEMES 104 5.5 EXPERIMENTAL RESULTS 108 5.5.1 PROCESSING TIME 109 5.5.2 CLASSIFICATION PERFORMANCE 112 5.5.3 DISCUSSION OF RESULTS 114 5.6 CONCLUSION 115 6 FRAMEWORK FOR TEXT CLASSIFICATION 117 6.1 NOVEL TRENDS IN TEXT CLASSIFICATION 122 6.1.1 INFORMATION SEMANTICS 123 6.1.2 INFORMATION EXTRACTION 124 6.1.3 INFORMATION DISTRIBUTED SYSTEMS 126 6.2 CONCLUSION 127 A REUTERS-21578 129 A.I INTRODUCTION 129 A.2 HISTORY 129 A.3 FORMATTING 130 A.4 THE REUTERS TAG 130 A.5 DOCUMENT-INTERNAL TAGS 132 A.6 CATEGORIES 133 A.7 USING REUTERS-21578 FOR TEXT CATEGORIZATION RESEARCH 134 A.7.1 THE MODIFIED LEWIS ( MODLEWIS ) SPLIT 135 A. XVI CONTENTS B.3.1 TOPIC CODES 140 B.3.2 CODING POLICY 142 B.4 STOPWORDS 142 REFERENCES 143 INDEX 153
any_adam_object 1
author Silva, Catarina
Ribeiro, Bernardete
author_facet Silva, Catarina
Ribeiro, Bernardete
author_role aut
aut
author_sort Silva, Catarina
author_variant c s cs
b r br
building Verbundindex
bvnumber BV036524382
classification_rvk ST 300
ctrlnum (OCoLC)553585436
(DE-599)BVBBV036524382
dewey-full 006.35
dewey-hundreds 000 - Computer science, information, general works
dewey-ones 006 - Special computer methods
dewey-raw 006.35
dewey-search 006.35
dewey-sort 16.35
dewey-tens 000 - Computer science, information, general works
discipline Informatik
format Book
fullrecord <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01361nam a22003372cb4500</leader><controlfield tag="001">BV036524382</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">00000000000000.0</controlfield><controlfield tag="007">t|</controlfield><controlfield tag="008">100625s2010 xx d||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9783642045325</subfield><subfield code="9">978-3-642-04532-5</subfield></datafield><datafield tag="020" ind1=" " ind2=" "><subfield code="z">9783642045332</subfield><subfield code="c">eISBN</subfield><subfield code="9">978-3-642-04533-2</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)553585436</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)BVBBV036524382</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-11</subfield></datafield><datafield tag="082" ind1="0" ind2=" "><subfield code="a">006.35</subfield><subfield code="2">22/ger</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 300</subfield><subfield code="0">(DE-625)143650:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Silva, Catarina</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Inductive inference for large scale text classification</subfield><subfield code="b">Kernel approaches and techniques</subfield><subfield code="c">Catarina Silva and Bernadete Ribeiro</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Berlin [u.a.]</subfield><subfield code="b">Springer</subfield><subfield code="c">2010</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">XX, 155 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="1" ind2=" "><subfield code="a">Studies in computational intelligence</subfield><subfield code="v">255</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Ribeiro, Bernardete</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="830" ind1=" " ind2="0"><subfield code="a">Studies in computational intelligence</subfield><subfield code="v">255</subfield><subfield code="w">(DE-604)BV020822171</subfield><subfield code="9">255</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">DNB Datenaustausch</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&amp;doc_library=BVB01&amp;local_base=BVB01&amp;doc_number=020446326&amp;sequence=000001&amp;line_number=0001&amp;func_code=DB_RECORDS&amp;service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-020446326</subfield></datafield></record></collection>
id DE-604.BV036524382
illustrated Illustrated
indexdate 2024-12-24T00:06:25Z
institution BVB
isbn 9783642045325
language English
oai_aleph_id oai:aleph.bib-bvb.de:BVB01-020446326
oclc_num 553585436
open_access_boolean
owner DE-11
owner_facet DE-11
physical XX, 155 S. graph. Darst.
publishDate 2010
publishDateSearch 2010
publishDateSort 2010
publisher Springer
record_format marc
series Studies in computational intelligence
series2 Studies in computational intelligence
spellingShingle Silva, Catarina
Ribeiro, Bernardete
Inductive inference for large scale text classification Kernel approaches and techniques
Studies in computational intelligence
title Inductive inference for large scale text classification Kernel approaches and techniques
title_auth Inductive inference for large scale text classification Kernel approaches and techniques
title_exact_search Inductive inference for large scale text classification Kernel approaches and techniques
title_full Inductive inference for large scale text classification Kernel approaches and techniques Catarina Silva and Bernadete Ribeiro
title_fullStr Inductive inference for large scale text classification Kernel approaches and techniques Catarina Silva and Bernadete Ribeiro
title_full_unstemmed Inductive inference for large scale text classification Kernel approaches and techniques Catarina Silva and Bernadete Ribeiro
title_short Inductive inference for large scale text classification
title_sort inductive inference for large scale text classification kernel approaches and techniques
title_sub Kernel approaches and techniques
url http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=020446326&sequence=000001&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA
volume_link (DE-604)BV020822171
work_keys_str_mv AT silvacatarina inductiveinferenceforlargescaletextclassificationkernelapproachesandtechniques
AT ribeirobernardete inductiveinferenceforlargescaletextclassificationkernelapproachesandtechniques