Effects of Porting Essie Tokenization and Normalization to Solr
Search for information is now an integral part of healthcare. Searches are enabled by search engines whose objective is to efficiently retrieve the relevant information for the user query. When it comes to retrieving biomedical text and literature, Essie search engine developed at the National Libra...
Gespeichert in:
Veröffentlicht in: | AMIA ... Annual Symposium proceedings 2023, Vol.2023, p.369-378 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 378 |
---|---|
container_issue | |
container_start_page | 369 |
container_title | AMIA ... Annual Symposium proceedings |
container_volume | 2023 |
creator | Gayen, Soumya Gupta, Deepak F Loane, Russell Ide, Nicholas C Demner-Fushman, Dina |
description | Search for information is now an integral part of healthcare. Searches are enabled by search engines whose objective is to efficiently retrieve the relevant information for the user query. When it comes to retrieving biomedical text and literature, Essie search engine developed at the National Library of Medicine (NLM) performs exceptionally well. However, Essie is a software system developed for NLM that has ceased development and support. On the other hand, Solr is a popular opensource enterprise search engine used by many of the world's largest internet sites, offering continuous developments and improvements along with the state-of-the-art features. In this paper, we present our approach to porting the key features of Essie and developing custom components to be used in Solr. We demonstrate the effectiveness of the added components on three benchmark biomedical datasets. The custom components may aid the community in improving search methods for biomedical text retrieval. |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_2914255696</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2914255696</sourcerecordid><originalsourceid>FETCH-LOGICAL-p126t-40bad83bd0ee1cb40ef68b62e36e69e2c955a772a13b2d2688561df45085e2c93</originalsourceid><addsrcrecordid>eNo1j01LxDAYhIMg7rr6FyRHLwv5bnoSWeoHLCq4nkvSvJFo29QkPeivd8Xd08DMwzBzgpZUynotSKUW6DznD0JEJbU6QwuuGWOCkyW6abyHrmQcPX6JqYTxHTc5B8C7-Alj-DElxBGb0eGnmAbTH50S8Wvs0wU69abPcHnQFXq7a3abh_X2-f5xc7tdT5Spsh9hjdPcOgJAOysIeKWtYsAVqBpYV0tpqooZyi1zTGktFXVeSKLlX8pX6Pq_d0rxa4Zc2iHkDvrejBDn3LKaCialqtUevTqgsx3AtVMKg0nf7fE0_wXIflFz</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2914255696</pqid></control><display><type>article</type><title>Effects of Porting Essie Tokenization and Normalization to Solr</title><source>MEDLINE</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><creator>Gayen, Soumya ; Gupta, Deepak ; F Loane, Russell ; Ide, Nicholas C ; Demner-Fushman, Dina</creator><creatorcontrib>Gayen, Soumya ; Gupta, Deepak ; F Loane, Russell ; Ide, Nicholas C ; Demner-Fushman, Dina</creatorcontrib><description>Search for information is now an integral part of healthcare. Searches are enabled by search engines whose objective is to efficiently retrieve the relevant information for the user query. When it comes to retrieving biomedical text and literature, Essie search engine developed at the National Library of Medicine (NLM) performs exceptionally well. However, Essie is a software system developed for NLM that has ceased development and support. On the other hand, Solr is a popular opensource enterprise search engine used by many of the world's largest internet sites, offering continuous developments and improvements along with the state-of-the-art features. In this paper, we present our approach to porting the key features of Essie and developing custom components to be used in Solr. We demonstrate the effectiveness of the added components on three benchmark biomedical datasets. The custom components may aid the community in improving search methods for biomedical text retrieval.</description><identifier>EISSN: 1559-4076</identifier><identifier>PMID: 38222430</identifier><language>eng</language><publisher>United States</publisher><subject>Benchmarking ; Humans ; Information Storage and Retrieval ; Internet ; National Library of Medicine (U.S.) ; Search Engine ; Software ; United States</subject><ispartof>AMIA ... Annual Symposium proceedings, 2023, Vol.2023, p.369-378</ispartof><rights>2023 AMIA - All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,4010</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38222430$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Gayen, Soumya</creatorcontrib><creatorcontrib>Gupta, Deepak</creatorcontrib><creatorcontrib>F Loane, Russell</creatorcontrib><creatorcontrib>Ide, Nicholas C</creatorcontrib><creatorcontrib>Demner-Fushman, Dina</creatorcontrib><title>Effects of Porting Essie Tokenization and Normalization to Solr</title><title>AMIA ... Annual Symposium proceedings</title><addtitle>AMIA Annu Symp Proc</addtitle><description>Search for information is now an integral part of healthcare. Searches are enabled by search engines whose objective is to efficiently retrieve the relevant information for the user query. When it comes to retrieving biomedical text and literature, Essie search engine developed at the National Library of Medicine (NLM) performs exceptionally well. However, Essie is a software system developed for NLM that has ceased development and support. On the other hand, Solr is a popular opensource enterprise search engine used by many of the world's largest internet sites, offering continuous developments and improvements along with the state-of-the-art features. In this paper, we present our approach to porting the key features of Essie and developing custom components to be used in Solr. We demonstrate the effectiveness of the added components on three benchmark biomedical datasets. The custom components may aid the community in improving search methods for biomedical text retrieval.</description><subject>Benchmarking</subject><subject>Humans</subject><subject>Information Storage and Retrieval</subject><subject>Internet</subject><subject>National Library of Medicine (U.S.)</subject><subject>Search Engine</subject><subject>Software</subject><subject>United States</subject><issn>1559-4076</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNo1j01LxDAYhIMg7rr6FyRHLwv5bnoSWeoHLCq4nkvSvJFo29QkPeivd8Xd08DMwzBzgpZUynotSKUW6DznD0JEJbU6QwuuGWOCkyW6abyHrmQcPX6JqYTxHTc5B8C7-Alj-DElxBGb0eGnmAbTH50S8Wvs0wU69abPcHnQFXq7a3abh_X2-f5xc7tdT5Spsh9hjdPcOgJAOysIeKWtYsAVqBpYV0tpqooZyi1zTGktFXVeSKLlX8pX6Pq_d0rxa4Zc2iHkDvrejBDn3LKaCialqtUevTqgsx3AtVMKg0nf7fE0_wXIflFz</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Gayen, Soumya</creator><creator>Gupta, Deepak</creator><creator>F Loane, Russell</creator><creator>Ide, Nicholas C</creator><creator>Demner-Fushman, Dina</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>7X8</scope></search><sort><creationdate>2023</creationdate><title>Effects of Porting Essie Tokenization and Normalization to Solr</title><author>Gayen, Soumya ; Gupta, Deepak ; F Loane, Russell ; Ide, Nicholas C ; Demner-Fushman, Dina</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p126t-40bad83bd0ee1cb40ef68b62e36e69e2c955a772a13b2d2688561df45085e2c93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Benchmarking</topic><topic>Humans</topic><topic>Information Storage and Retrieval</topic><topic>Internet</topic><topic>National Library of Medicine (U.S.)</topic><topic>Search Engine</topic><topic>Software</topic><topic>United States</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gayen, Soumya</creatorcontrib><creatorcontrib>Gupta, Deepak</creatorcontrib><creatorcontrib>F Loane, Russell</creatorcontrib><creatorcontrib>Ide, Nicholas C</creatorcontrib><creatorcontrib>Demner-Fushman, Dina</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>MEDLINE - Academic</collection><jtitle>AMIA ... Annual Symposium proceedings</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gayen, Soumya</au><au>Gupta, Deepak</au><au>F Loane, Russell</au><au>Ide, Nicholas C</au><au>Demner-Fushman, Dina</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Effects of Porting Essie Tokenization and Normalization to Solr</atitle><jtitle>AMIA ... Annual Symposium proceedings</jtitle><addtitle>AMIA Annu Symp Proc</addtitle><date>2023</date><risdate>2023</risdate><volume>2023</volume><spage>369</spage><epage>378</epage><pages>369-378</pages><eissn>1559-4076</eissn><abstract>Search for information is now an integral part of healthcare. Searches are enabled by search engines whose objective is to efficiently retrieve the relevant information for the user query. When it comes to retrieving biomedical text and literature, Essie search engine developed at the National Library of Medicine (NLM) performs exceptionally well. However, Essie is a software system developed for NLM that has ceased development and support. On the other hand, Solr is a popular opensource enterprise search engine used by many of the world's largest internet sites, offering continuous developments and improvements along with the state-of-the-art features. In this paper, we present our approach to porting the key features of Essie and developing custom components to be used in Solr. We demonstrate the effectiveness of the added components on three benchmark biomedical datasets. The custom components may aid the community in improving search methods for biomedical text retrieval.</abstract><cop>United States</cop><pmid>38222430</pmid><tpages>10</tpages></addata></record> |
fulltext | fulltext |
identifier | EISSN: 1559-4076 |
ispartof | AMIA ... Annual Symposium proceedings, 2023, Vol.2023, p.369-378 |
issn | 1559-4076 |
language | eng |
recordid | cdi_proquest_miscellaneous_2914255696 |
source | MEDLINE; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central |
subjects | Benchmarking Humans Information Storage and Retrieval Internet National Library of Medicine (U.S.) Search Engine Software United States |
title | Effects of Porting Essie Tokenization and Normalization to Solr |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T03%3A43%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Effects%20of%20Porting%20Essie%20Tokenization%20and%20Normalization%20to%20Solr&rft.jtitle=AMIA%20...%20Annual%20Symposium%20proceedings&rft.au=Gayen,%20Soumya&rft.date=2023&rft.volume=2023&rft.spage=369&rft.epage=378&rft.pages=369-378&rft.eissn=1559-4076&rft_id=info:doi/&rft_dat=%3Cproquest_pubme%3E2914255696%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2914255696&rft_id=info:pmid/38222430&rfr_iscdi=true |