Filtering Template driven spam mails using Vector Space models
Spam became a big problem to the society. Some spammers are using templates for sending spam. To send a particular promotion they create some template and merge the details of receivers with the template. Similarities can find among these mails and easily ignore the forthcoming spam. Most high-volum...
Gespeichert in:
Veröffentlicht in: | International journal of computer applications 2012-01, Vol.39 (14), p.33-35 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 35 |
---|---|
container_issue | 14 |
container_start_page | 33 |
container_title | International journal of computer applications |
container_volume | 39 |
creator | Varghese, Liny M.H, Supriya Poulose Jacob, K. |
description | Spam became a big problem to the society. Some spammers are using templates for sending spam. To send a particular promotion they create some template and merge the details of receivers with the template. Similarities can find among these mails and easily ignore the forthcoming spam. Most high-volume spam is sent using tools those randomizes parts of the message - subject, body, sender address etc. The general form of the template that the spammer is using can often guess by inspecting the features of messages. Most of the spam filters are either rule based models or Bayesian models. The main objective in this paper is to find out semantic distance and evaluate the applicability of the two information retrieval techniques, Simple Vector Space Models (VSM) and VSM using Rocchio Classification in the spam context. Both methods are using cosine similarities to identify the spam |
doi_str_mv | 10.5120/4891-7383 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1031306952</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2659295621</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1303-fa72196567160eaea1f89b20763c37067a5c27ebcaf7687c0e72731ab035599b3</originalsourceid><addsrcrecordid>eNpd0EtLAzEQB_AgCpbag98g4EUPq3k0r4sgxapQ8GD1GrLprGzJPkx2Bb99s9SDOJeZw4-Z4Y_QJSW3gjJyt9SGFoprfoJmxChRaK3V6Z_5HC1S2pNc3DBpljN0v67DALFuP_EWmj64AfAu1t_Q4tS7BjeuDgmPaQIf4Icu4rfeecBNt4OQLtBZ5UKCxW-fo_f143b1XGxen15WD5vCU054UTnFqJFCKioJOHC00qZkREnuuSJSOeGZgtK7SkmtPAHFFKeuJFwIY0o-R9fHvX3svkZIg23q5CEE10I3JksJz4ekESzTq390342xzd9lRVn-YUl1VjdH5WOXUoTK9rFuXPzJyE5h2ilMO4XJD6EvY30</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1012567418</pqid></control><display><type>article</type><title>Filtering Template driven spam mails using Vector Space models</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Varghese, Liny ; M.H, Supriya ; Poulose Jacob, K.</creator><creatorcontrib>Varghese, Liny ; M.H, Supriya ; Poulose Jacob, K.</creatorcontrib><description>Spam became a big problem to the society. Some spammers are using templates for sending spam. To send a particular promotion they create some template and merge the details of receivers with the template. Similarities can find among these mails and easily ignore the forthcoming spam. Most high-volume spam is sent using tools those randomizes parts of the message - subject, body, sender address etc. The general form of the template that the spammer is using can often guess by inspecting the features of messages. Most of the spam filters are either rule based models or Bayesian models. The main objective in this paper is to find out semantic distance and evaluate the applicability of the two information retrieval techniques, Simple Vector Space Models (VSM) and VSM using Rocchio Classification in the spam context. Both methods are using cosine similarities to identify the spam</description><identifier>ISSN: 0975-8887</identifier><identifier>EISSN: 0975-8887</identifier><identifier>DOI: 10.5120/4891-7383</identifier><language>eng</language><publisher>New York: Foundation of Computer Science</publisher><subject>Analogies ; Filtering ; Mail ; Mathematical models ; Messages ; Rule based ; Spamming ; Vector spaces</subject><ispartof>International journal of computer applications, 2012-01, Vol.39 (14), p.33-35</ispartof><rights>Copyright Foundation of Computer Science 2012</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27915,27916</link.rule.ids></links><search><creatorcontrib>Varghese, Liny</creatorcontrib><creatorcontrib>M.H, Supriya</creatorcontrib><creatorcontrib>Poulose Jacob, K.</creatorcontrib><title>Filtering Template driven spam mails using Vector Space models</title><title>International journal of computer applications</title><description>Spam became a big problem to the society. Some spammers are using templates for sending spam. To send a particular promotion they create some template and merge the details of receivers with the template. Similarities can find among these mails and easily ignore the forthcoming spam. Most high-volume spam is sent using tools those randomizes parts of the message - subject, body, sender address etc. The general form of the template that the spammer is using can often guess by inspecting the features of messages. Most of the spam filters are either rule based models or Bayesian models. The main objective in this paper is to find out semantic distance and evaluate the applicability of the two information retrieval techniques, Simple Vector Space Models (VSM) and VSM using Rocchio Classification in the spam context. Both methods are using cosine similarities to identify the spam</description><subject>Analogies</subject><subject>Filtering</subject><subject>Mail</subject><subject>Mathematical models</subject><subject>Messages</subject><subject>Rule based</subject><subject>Spamming</subject><subject>Vector spaces</subject><issn>0975-8887</issn><issn>0975-8887</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><recordid>eNpd0EtLAzEQB_AgCpbag98g4EUPq3k0r4sgxapQ8GD1GrLprGzJPkx2Bb99s9SDOJeZw4-Z4Y_QJSW3gjJyt9SGFoprfoJmxChRaK3V6Z_5HC1S2pNc3DBpljN0v67DALFuP_EWmj64AfAu1t_Q4tS7BjeuDgmPaQIf4Icu4rfeecBNt4OQLtBZ5UKCxW-fo_f143b1XGxen15WD5vCU054UTnFqJFCKioJOHC00qZkREnuuSJSOeGZgtK7SkmtPAHFFKeuJFwIY0o-R9fHvX3svkZIg23q5CEE10I3JksJz4ekESzTq390342xzd9lRVn-YUl1VjdH5WOXUoTK9rFuXPzJyE5h2ilMO4XJD6EvY30</recordid><startdate>20120101</startdate><enddate>20120101</enddate><creator>Varghese, Liny</creator><creator>M.H, Supriya</creator><creator>Poulose Jacob, K.</creator><general>Foundation of Computer Science</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20120101</creationdate><title>Filtering Template driven spam mails using Vector Space models</title><author>Varghese, Liny ; M.H, Supriya ; Poulose Jacob, K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1303-fa72196567160eaea1f89b20763c37067a5c27ebcaf7687c0e72731ab035599b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Analogies</topic><topic>Filtering</topic><topic>Mail</topic><topic>Mathematical models</topic><topic>Messages</topic><topic>Rule based</topic><topic>Spamming</topic><topic>Vector spaces</topic><toplevel>online_resources</toplevel><creatorcontrib>Varghese, Liny</creatorcontrib><creatorcontrib>M.H, Supriya</creatorcontrib><creatorcontrib>Poulose Jacob, K.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>International journal of computer applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Varghese, Liny</au><au>M.H, Supriya</au><au>Poulose Jacob, K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Filtering Template driven spam mails using Vector Space models</atitle><jtitle>International journal of computer applications</jtitle><date>2012-01-01</date><risdate>2012</risdate><volume>39</volume><issue>14</issue><spage>33</spage><epage>35</epage><pages>33-35</pages><issn>0975-8887</issn><eissn>0975-8887</eissn><abstract>Spam became a big problem to the society. Some spammers are using templates for sending spam. To send a particular promotion they create some template and merge the details of receivers with the template. Similarities can find among these mails and easily ignore the forthcoming spam. Most high-volume spam is sent using tools those randomizes parts of the message - subject, body, sender address etc. The general form of the template that the spammer is using can often guess by inspecting the features of messages. Most of the spam filters are either rule based models or Bayesian models. The main objective in this paper is to find out semantic distance and evaluate the applicability of the two information retrieval techniques, Simple Vector Space Models (VSM) and VSM using Rocchio Classification in the spam context. Both methods are using cosine similarities to identify the spam</abstract><cop>New York</cop><pub>Foundation of Computer Science</pub><doi>10.5120/4891-7383</doi><tpages>3</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0975-8887 |
ispartof | International journal of computer applications, 2012-01, Vol.39 (14), p.33-35 |
issn | 0975-8887 0975-8887 |
language | eng |
recordid | cdi_proquest_miscellaneous_1031306952 |
source | EZB-FREE-00999 freely available EZB journals |
subjects | Analogies Filtering Mathematical models Messages Rule based Spamming Vector spaces |
title | Filtering Template driven spam mails using Vector Space models |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T00%3A36%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Filtering%20Template%20driven%20spam%20mails%20using%20Vector%20Space%20models&rft.jtitle=International%20journal%20of%20computer%20applications&rft.au=Varghese,%20Liny&rft.date=2012-01-01&rft.volume=39&rft.issue=14&rft.spage=33&rft.epage=35&rft.pages=33-35&rft.issn=0975-8887&rft.eissn=0975-8887&rft_id=info:doi/10.5120/4891-7383&rft_dat=%3Cproquest_cross%3E2659295621%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1012567418&rft_id=info:pmid/&rfr_iscdi=true |