Filtering Template driven spam mails using Vector Space models

Spam became a big problem to the society. Some spammers are using templates for sending spam. To send a particular promotion they create some template and merge the details of receivers with the template. Similarities can find among these mails and easily ignore the forthcoming spam. Most high-volum...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of computer applications 2012-01, Vol.39 (14), p.33-35
Hauptverfasser: Varghese, Liny, M.H, Supriya, Poulose Jacob, K.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 35
container_issue 14
container_start_page 33
container_title International journal of computer applications
container_volume 39
creator Varghese, Liny
M.H, Supriya
Poulose Jacob, K.
description Spam became a big problem to the society. Some spammers are using templates for sending spam. To send a particular promotion they create some template and merge the details of receivers with the template. Similarities can find among these mails and easily ignore the forthcoming spam. Most high-volume spam is sent using tools those randomizes parts of the message - subject, body, sender address etc. The general form of the template that the spammer is using can often guess by inspecting the features of messages. Most of the spam filters are either rule based models or Bayesian models. The main objective in this paper is to find out semantic distance and evaluate the applicability of the two information retrieval techniques, Simple Vector Space Models (VSM) and VSM using Rocchio Classification in the spam context. Both methods are using cosine similarities to identify the spam
doi_str_mv 10.5120/4891-7383
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1031306952</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2659295621</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1303-fa72196567160eaea1f89b20763c37067a5c27ebcaf7687c0e72731ab035599b3</originalsourceid><addsrcrecordid>eNpd0EtLAzEQB_AgCpbag98g4EUPq3k0r4sgxapQ8GD1GrLprGzJPkx2Bb99s9SDOJeZw4-Z4Y_QJSW3gjJyt9SGFoprfoJmxChRaK3V6Z_5HC1S2pNc3DBpljN0v67DALFuP_EWmj64AfAu1t_Q4tS7BjeuDgmPaQIf4Icu4rfeecBNt4OQLtBZ5UKCxW-fo_f143b1XGxen15WD5vCU054UTnFqJFCKioJOHC00qZkREnuuSJSOeGZgtK7SkmtPAHFFKeuJFwIY0o-R9fHvX3svkZIg23q5CEE10I3JksJz4ekESzTq390342xzd9lRVn-YUl1VjdH5WOXUoTK9rFuXPzJyE5h2ilMO4XJD6EvY30</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1012567418</pqid></control><display><type>article</type><title>Filtering Template driven spam mails using Vector Space models</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Varghese, Liny ; M.H, Supriya ; Poulose Jacob, K.</creator><creatorcontrib>Varghese, Liny ; M.H, Supriya ; Poulose Jacob, K.</creatorcontrib><description>Spam became a big problem to the society. Some spammers are using templates for sending spam. To send a particular promotion they create some template and merge the details of receivers with the template. Similarities can find among these mails and easily ignore the forthcoming spam. Most high-volume spam is sent using tools those randomizes parts of the message - subject, body, sender address etc. The general form of the template that the spammer is using can often guess by inspecting the features of messages. Most of the spam filters are either rule based models or Bayesian models. The main objective in this paper is to find out semantic distance and evaluate the applicability of the two information retrieval techniques, Simple Vector Space Models (VSM) and VSM using Rocchio Classification in the spam context. Both methods are using cosine similarities to identify the spam</description><identifier>ISSN: 0975-8887</identifier><identifier>EISSN: 0975-8887</identifier><identifier>DOI: 10.5120/4891-7383</identifier><language>eng</language><publisher>New York: Foundation of Computer Science</publisher><subject>Analogies ; Filtering ; Mail ; Mathematical models ; Messages ; Rule based ; Spamming ; Vector spaces</subject><ispartof>International journal of computer applications, 2012-01, Vol.39 (14), p.33-35</ispartof><rights>Copyright Foundation of Computer Science 2012</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27915,27916</link.rule.ids></links><search><creatorcontrib>Varghese, Liny</creatorcontrib><creatorcontrib>M.H, Supriya</creatorcontrib><creatorcontrib>Poulose Jacob, K.</creatorcontrib><title>Filtering Template driven spam mails using Vector Space models</title><title>International journal of computer applications</title><description>Spam became a big problem to the society. Some spammers are using templates for sending spam. To send a particular promotion they create some template and merge the details of receivers with the template. Similarities can find among these mails and easily ignore the forthcoming spam. Most high-volume spam is sent using tools those randomizes parts of the message - subject, body, sender address etc. The general form of the template that the spammer is using can often guess by inspecting the features of messages. Most of the spam filters are either rule based models or Bayesian models. The main objective in this paper is to find out semantic distance and evaluate the applicability of the two information retrieval techniques, Simple Vector Space Models (VSM) and VSM using Rocchio Classification in the spam context. Both methods are using cosine similarities to identify the spam</description><subject>Analogies</subject><subject>Filtering</subject><subject>Mail</subject><subject>Mathematical models</subject><subject>Messages</subject><subject>Rule based</subject><subject>Spamming</subject><subject>Vector spaces</subject><issn>0975-8887</issn><issn>0975-8887</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><recordid>eNpd0EtLAzEQB_AgCpbag98g4EUPq3k0r4sgxapQ8GD1GrLprGzJPkx2Bb99s9SDOJeZw4-Z4Y_QJSW3gjJyt9SGFoprfoJmxChRaK3V6Z_5HC1S2pNc3DBpljN0v67DALFuP_EWmj64AfAu1t_Q4tS7BjeuDgmPaQIf4Icu4rfeecBNt4OQLtBZ5UKCxW-fo_f143b1XGxen15WD5vCU054UTnFqJFCKioJOHC00qZkREnuuSJSOeGZgtK7SkmtPAHFFKeuJFwIY0o-R9fHvX3svkZIg23q5CEE10I3JksJz4ekESzTq390342xzd9lRVn-YUl1VjdH5WOXUoTK9rFuXPzJyE5h2ilMO4XJD6EvY30</recordid><startdate>20120101</startdate><enddate>20120101</enddate><creator>Varghese, Liny</creator><creator>M.H, Supriya</creator><creator>Poulose Jacob, K.</creator><general>Foundation of Computer Science</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20120101</creationdate><title>Filtering Template driven spam mails using Vector Space models</title><author>Varghese, Liny ; M.H, Supriya ; Poulose Jacob, K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1303-fa72196567160eaea1f89b20763c37067a5c27ebcaf7687c0e72731ab035599b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Analogies</topic><topic>Filtering</topic><topic>Mail</topic><topic>Mathematical models</topic><topic>Messages</topic><topic>Rule based</topic><topic>Spamming</topic><topic>Vector spaces</topic><toplevel>online_resources</toplevel><creatorcontrib>Varghese, Liny</creatorcontrib><creatorcontrib>M.H, Supriya</creatorcontrib><creatorcontrib>Poulose Jacob, K.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>International journal of computer applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Varghese, Liny</au><au>M.H, Supriya</au><au>Poulose Jacob, K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Filtering Template driven spam mails using Vector Space models</atitle><jtitle>International journal of computer applications</jtitle><date>2012-01-01</date><risdate>2012</risdate><volume>39</volume><issue>14</issue><spage>33</spage><epage>35</epage><pages>33-35</pages><issn>0975-8887</issn><eissn>0975-8887</eissn><abstract>Spam became a big problem to the society. Some spammers are using templates for sending spam. To send a particular promotion they create some template and merge the details of receivers with the template. Similarities can find among these mails and easily ignore the forthcoming spam. Most high-volume spam is sent using tools those randomizes parts of the message - subject, body, sender address etc. The general form of the template that the spammer is using can often guess by inspecting the features of messages. Most of the spam filters are either rule based models or Bayesian models. The main objective in this paper is to find out semantic distance and evaluate the applicability of the two information retrieval techniques, Simple Vector Space Models (VSM) and VSM using Rocchio Classification in the spam context. Both methods are using cosine similarities to identify the spam</abstract><cop>New York</cop><pub>Foundation of Computer Science</pub><doi>10.5120/4891-7383</doi><tpages>3</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0975-8887
ispartof International journal of computer applications, 2012-01, Vol.39 (14), p.33-35
issn 0975-8887
0975-8887
language eng
recordid cdi_proquest_miscellaneous_1031306952
source EZB-FREE-00999 freely available EZB journals
subjects Analogies
Filtering
Mail
Mathematical models
Messages
Rule based
Spamming
Vector spaces
title Filtering Template driven spam mails using Vector Space models
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T00%3A36%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Filtering%20Template%20driven%20spam%20mails%20using%20Vector%20Space%20models&rft.jtitle=International%20journal%20of%20computer%20applications&rft.au=Varghese,%20Liny&rft.date=2012-01-01&rft.volume=39&rft.issue=14&rft.spage=33&rft.epage=35&rft.pages=33-35&rft.issn=0975-8887&rft.eissn=0975-8887&rft_id=info:doi/10.5120/4891-7383&rft_dat=%3Cproquest_cross%3E2659295621%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1012567418&rft_id=info:pmid/&rfr_iscdi=true