Effects of Porting Essie Tokenization and Normalization to Solr

Search for information is now an integral part of healthcare. Searches are enabled by search engines whose objective is to efficiently retrieve the relevant information for the user query. When it comes to retrieving biomedical text and literature, Essie search engine developed at the National Libra...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:AMIA ... Annual Symposium proceedings 2023, Vol.2023, p.369-378
Hauptverfasser: Gayen, Soumya, Gupta, Deepak, F Loane, Russell, Ide, Nicholas C, Demner-Fushman, Dina
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 378
container_issue
container_start_page 369
container_title AMIA ... Annual Symposium proceedings
container_volume 2023
creator Gayen, Soumya
Gupta, Deepak
F Loane, Russell
Ide, Nicholas C
Demner-Fushman, Dina
description Search for information is now an integral part of healthcare. Searches are enabled by search engines whose objective is to efficiently retrieve the relevant information for the user query. When it comes to retrieving biomedical text and literature, Essie search engine developed at the National Library of Medicine (NLM) performs exceptionally well. However, Essie is a software system developed for NLM that has ceased development and support. On the other hand, Solr is a popular opensource enterprise search engine used by many of the world's largest internet sites, offering continuous developments and improvements along with the state-of-the-art features. In this paper, we present our approach to porting the key features of Essie and developing custom components to be used in Solr. We demonstrate the effectiveness of the added components on three benchmark biomedical datasets. The custom components may aid the community in improving search methods for biomedical text retrieval.
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_2914255696</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2914255696</sourcerecordid><originalsourceid>FETCH-LOGICAL-p126t-40bad83bd0ee1cb40ef68b62e36e69e2c955a772a13b2d2688561df45085e2c93</originalsourceid><addsrcrecordid>eNo1j01LxDAYhIMg7rr6FyRHLwv5bnoSWeoHLCq4nkvSvJFo29QkPeivd8Xd08DMwzBzgpZUynotSKUW6DznD0JEJbU6QwuuGWOCkyW6abyHrmQcPX6JqYTxHTc5B8C7-Alj-DElxBGb0eGnmAbTH50S8Wvs0wU69abPcHnQFXq7a3abh_X2-f5xc7tdT5Spsh9hjdPcOgJAOysIeKWtYsAVqBpYV0tpqooZyi1zTGktFXVeSKLlX8pX6Pq_d0rxa4Zc2iHkDvrejBDn3LKaCialqtUevTqgsx3AtVMKg0nf7fE0_wXIflFz</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2914255696</pqid></control><display><type>article</type><title>Effects of Porting Essie Tokenization and Normalization to Solr</title><source>MEDLINE</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><creator>Gayen, Soumya ; Gupta, Deepak ; F Loane, Russell ; Ide, Nicholas C ; Demner-Fushman, Dina</creator><creatorcontrib>Gayen, Soumya ; Gupta, Deepak ; F Loane, Russell ; Ide, Nicholas C ; Demner-Fushman, Dina</creatorcontrib><description>Search for information is now an integral part of healthcare. Searches are enabled by search engines whose objective is to efficiently retrieve the relevant information for the user query. When it comes to retrieving biomedical text and literature, Essie search engine developed at the National Library of Medicine (NLM) performs exceptionally well. However, Essie is a software system developed for NLM that has ceased development and support. On the other hand, Solr is a popular opensource enterprise search engine used by many of the world's largest internet sites, offering continuous developments and improvements along with the state-of-the-art features. In this paper, we present our approach to porting the key features of Essie and developing custom components to be used in Solr. We demonstrate the effectiveness of the added components on three benchmark biomedical datasets. The custom components may aid the community in improving search methods for biomedical text retrieval.</description><identifier>EISSN: 1559-4076</identifier><identifier>PMID: 38222430</identifier><language>eng</language><publisher>United States</publisher><subject>Benchmarking ; Humans ; Information Storage and Retrieval ; Internet ; National Library of Medicine (U.S.) ; Search Engine ; Software ; United States</subject><ispartof>AMIA ... Annual Symposium proceedings, 2023, Vol.2023, p.369-378</ispartof><rights>2023 AMIA - All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,4010</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38222430$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Gayen, Soumya</creatorcontrib><creatorcontrib>Gupta, Deepak</creatorcontrib><creatorcontrib>F Loane, Russell</creatorcontrib><creatorcontrib>Ide, Nicholas C</creatorcontrib><creatorcontrib>Demner-Fushman, Dina</creatorcontrib><title>Effects of Porting Essie Tokenization and Normalization to Solr</title><title>AMIA ... Annual Symposium proceedings</title><addtitle>AMIA Annu Symp Proc</addtitle><description>Search for information is now an integral part of healthcare. Searches are enabled by search engines whose objective is to efficiently retrieve the relevant information for the user query. When it comes to retrieving biomedical text and literature, Essie search engine developed at the National Library of Medicine (NLM) performs exceptionally well. However, Essie is a software system developed for NLM that has ceased development and support. On the other hand, Solr is a popular opensource enterprise search engine used by many of the world's largest internet sites, offering continuous developments and improvements along with the state-of-the-art features. In this paper, we present our approach to porting the key features of Essie and developing custom components to be used in Solr. We demonstrate the effectiveness of the added components on three benchmark biomedical datasets. The custom components may aid the community in improving search methods for biomedical text retrieval.</description><subject>Benchmarking</subject><subject>Humans</subject><subject>Information Storage and Retrieval</subject><subject>Internet</subject><subject>National Library of Medicine (U.S.)</subject><subject>Search Engine</subject><subject>Software</subject><subject>United States</subject><issn>1559-4076</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNo1j01LxDAYhIMg7rr6FyRHLwv5bnoSWeoHLCq4nkvSvJFo29QkPeivd8Xd08DMwzBzgpZUynotSKUW6DznD0JEJbU6QwuuGWOCkyW6abyHrmQcPX6JqYTxHTc5B8C7-Alj-DElxBGb0eGnmAbTH50S8Wvs0wU69abPcHnQFXq7a3abh_X2-f5xc7tdT5Spsh9hjdPcOgJAOysIeKWtYsAVqBpYV0tpqooZyi1zTGktFXVeSKLlX8pX6Pq_d0rxa4Zc2iHkDvrejBDn3LKaCialqtUevTqgsx3AtVMKg0nf7fE0_wXIflFz</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Gayen, Soumya</creator><creator>Gupta, Deepak</creator><creator>F Loane, Russell</creator><creator>Ide, Nicholas C</creator><creator>Demner-Fushman, Dina</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>7X8</scope></search><sort><creationdate>2023</creationdate><title>Effects of Porting Essie Tokenization and Normalization to Solr</title><author>Gayen, Soumya ; Gupta, Deepak ; F Loane, Russell ; Ide, Nicholas C ; Demner-Fushman, Dina</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p126t-40bad83bd0ee1cb40ef68b62e36e69e2c955a772a13b2d2688561df45085e2c93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Benchmarking</topic><topic>Humans</topic><topic>Information Storage and Retrieval</topic><topic>Internet</topic><topic>National Library of Medicine (U.S.)</topic><topic>Search Engine</topic><topic>Software</topic><topic>United States</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gayen, Soumya</creatorcontrib><creatorcontrib>Gupta, Deepak</creatorcontrib><creatorcontrib>F Loane, Russell</creatorcontrib><creatorcontrib>Ide, Nicholas C</creatorcontrib><creatorcontrib>Demner-Fushman, Dina</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>MEDLINE - Academic</collection><jtitle>AMIA ... Annual Symposium proceedings</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gayen, Soumya</au><au>Gupta, Deepak</au><au>F Loane, Russell</au><au>Ide, Nicholas C</au><au>Demner-Fushman, Dina</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Effects of Porting Essie Tokenization and Normalization to Solr</atitle><jtitle>AMIA ... Annual Symposium proceedings</jtitle><addtitle>AMIA Annu Symp Proc</addtitle><date>2023</date><risdate>2023</risdate><volume>2023</volume><spage>369</spage><epage>378</epage><pages>369-378</pages><eissn>1559-4076</eissn><abstract>Search for information is now an integral part of healthcare. Searches are enabled by search engines whose objective is to efficiently retrieve the relevant information for the user query. When it comes to retrieving biomedical text and literature, Essie search engine developed at the National Library of Medicine (NLM) performs exceptionally well. However, Essie is a software system developed for NLM that has ceased development and support. On the other hand, Solr is a popular opensource enterprise search engine used by many of the world's largest internet sites, offering continuous developments and improvements along with the state-of-the-art features. In this paper, we present our approach to porting the key features of Essie and developing custom components to be used in Solr. We demonstrate the effectiveness of the added components on three benchmark biomedical datasets. The custom components may aid the community in improving search methods for biomedical text retrieval.</abstract><cop>United States</cop><pmid>38222430</pmid><tpages>10</tpages></addata></record>
fulltext fulltext
identifier EISSN: 1559-4076
ispartof AMIA ... Annual Symposium proceedings, 2023, Vol.2023, p.369-378
issn 1559-4076
language eng
recordid cdi_proquest_miscellaneous_2914255696
source MEDLINE; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central
subjects Benchmarking
Humans
Information Storage and Retrieval
Internet
National Library of Medicine (U.S.)
Search Engine
Software
United States
title Effects of Porting Essie Tokenization and Normalization to Solr
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T03%3A43%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Effects%20of%20Porting%20Essie%20Tokenization%20and%20Normalization%20to%20Solr&rft.jtitle=AMIA%20...%20Annual%20Symposium%20proceedings&rft.au=Gayen,%20Soumya&rft.date=2023&rft.volume=2023&rft.spage=369&rft.epage=378&rft.pages=369-378&rft.eissn=1559-4076&rft_id=info:doi/&rft_dat=%3Cproquest_pubme%3E2914255696%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2914255696&rft_id=info:pmid/38222430&rfr_iscdi=true