Skip to Content

464 Results Found

  • Article
  • Open Access
4 Citations
2,871 Views
15 Pages

10 June 2022

Traditional mathematical search models retrieve scientific documents only by mathematical expressions and their contexts and do not consider the ontological attributes of scientific documents, which result in gaps between the queries and the retrieva...

  • Article
  • Open Access
9 Citations
4,873 Views
19 Pages

The rapid increase in scientific publications has made it challenging to keep up with the latest advancements. Conducting systematic reviews using traditional methods is both time-consuming and difficult. To address this, new review formats like rapi...

  • Proceeding Paper
  • Open Access

Transformer-based models face persistent challenges in long-document summarization due to fixed input-length constraints. Hybrid approaches address this limitation by applying extractive preprocessing to select salient sentences for downstream abstra...

  • Article
  • Open Access
1 Citations
3,404 Views
10 Pages

27 October 2020

Recently, the performance of machine-reading and comprehension (MRC) systems has been significantly enhanced. However, MRC systems require high-performance text retrieval models because text passages containing answer phrases should be prepared in ad...

  • Article
  • Open Access
19 Citations
5,097 Views
13 Pages

An Integrated Graph Model for Document Summarization

  • Kang Yang,
  • Kamal Al-Sabahi,
  • Yanmin Xiang and
  • Zuping Zhang

13 September 2018

Extractive summarization aims to produce a concise version of a document by extracting information-rich sentences from the original texts. The graph-based model is an effective and efficient approach to rank sentences since it is simple and easy to u...

  • Article
  • Open Access
9 Citations
4,800 Views
20 Pages

Multi-Document News Web Page Summarization Using Content Extraction and Lexical Chain Based Key Phrase Extraction

  • Chandrakala Arya,
  • Manoj Diwakar,
  • Prabhishek Singh,
  • Vijendra Singh,
  • Seifedine Kadry and
  • Jungeun Kim

7 April 2023

In the area of text summarization, there have been significant advances recently. In the meantime, the current trend in text summarization is focused more on news summarization. Therefore, developing a synthesis approach capable of extracting, compar...

  • Article
  • Open Access
1,907 Views
28 Pages

Handwritten Keyword Spotting (KWS) remains a challenging task, particularly in segmentation-free scenarios where word images must be retrieved and ranked based on their similarity to a query without relying on prior page-level segmentation. Tradition...

  • Article
  • Open Access
8 Citations
3,984 Views
22 Pages

Lightweight, Secure, Similar-Document Retrieval over Encrypted Data

  • Mustafa A. Al Sibahee,
  • Ayad I. Abdulsada,
  • Zaid Ameen Abduljabbar,
  • Junchao Ma,
  • Vincent Omollo Nyangaresi and
  • Samir M. Umran

17 December 2021

Applications for document similarity detection are widespread in diverse communities, including institutions and corporations. However, currently available detection systems fail to take into account the private nature of material or documents that h...

  • Article
  • Open Access
1 Citations
4,411 Views
21 Pages

23 November 2025

The rapid growth of digital journalism has heightened the need for reliable multi-document summarization (MDS) systems, particularly in underrepresented, low-resource, and culturally distinct contexts. However, current progress is hindered by a lack...

  • Article
  • Open Access
1 Citations
2,400 Views
14 Pages

5 December 2021

With the advent of cloud computing, the low-cost and high-capacity cloud storages have attracted people to move their data from local computers to the remote facilities. People can access and share their data with others at anytime, from anywhere. Ho...

  • Article
  • Open Access
2 Citations
2,453 Views
25 Pages

Learning to Co-Embed Queries and Documents

  • Yuehong Wu,
  • Bowen Lu,
  • Lin Tian and
  • Shangsong Liang

11 November 2022

Learning to Rank (L2R) methods that utilize machine learning techniques to solve the ranking problems have been widely studied in the field of information retrieval. Existing methods usually concatenate query and document features as training input,...

  • Article
  • Open Access
2 Citations
4,465 Views
20 Pages

3 September 2020

Evaluation of document classification is straightforward if complete information on the documents’ true categories exists. In this case, the rank of each document can be accurately determined and evaluated. However, in an unsupervised setting,...

  • Article
  • Open Access
2 Citations
4,489 Views
16 Pages

Exploring the Importance of Entities in Semantic Ranking

  • Zhenyang Li,
  • Guangluan Xu,
  • Xiao Liang,
  • Feng Li,
  • Lei Wang and
  • Daobing Zhang

24 January 2019

In recent years, entity-based ranking models have led to exciting breakthroughs in the research of information retrieval. Compared with traditional retrieval models, entity-based representation enables a better understanding of queries and documents....

  • Article
  • Open Access
1 Citations
2,954 Views
16 Pages

16 November 2022

This paper proposes a new methodology to study sequential corpora by implementing a two-stage algorithm that learns time-based topics with respect to a scale of document positions and introduces the concept of Topic Scaling, which ranks learned topic...

  • Article
  • Open Access
1,196 Views
14 Pages

3 November 2024

In the area of consumer health search (CHS), there is an increasing concern about returning topically relevant and understandable health information to the user. Besides being used to rank topically relevant documents, Learning to Rank (LTR) has also...

  • Article
  • Open Access
9 Citations
4,264 Views
25 Pages

29 April 2024

In this paper, we construct an academic literature knowledge graph based on the relationship between documents to facilitate the storage and research of academic literature data. Keywords are an important type of node in the knowledge graph. To solve...

  • Article
  • Open Access
3 Citations
6,456 Views
21 Pages

7 April 2025

Similar patent document retrieval is an essential task that reduces the scope of patent claimants’ searches, and numerous studies have attempted to provide automated patent search services. Recently, Retrieval-Augmented Generation (RAG) based o...

  • Article
  • Open Access
2 Citations
2,284 Views
13 Pages

7 October 2023

Automatic Keyphrase Extraction involves identifying essential phrases in a document. These keyphrases are crucial in various tasks, such as document classification, clustering, recommendation, indexing, searching, summarization, and text simplificati...

  • Article
  • Open Access
17 Citations
8,093 Views
23 Pages

Climate change puts pressure on existing health vulnerabilities through higher frequency of extreme weather events, changes in disease vector distribution or exacerbated air pollution. Climate change adaptation policies may hold potential to reduce s...

  • Article
  • Open Access
6 Citations
4,468 Views
17 Pages

Topic Models Ensembles for AD-HOC Information Retrieval

  • Pablo Ormeño,
  • Marcelo Mendoza and
  • Carlos Valle

1 September 2021

Ad hoc information retrieval (ad hoc IR) is a challenging task consisting of ranking text documents for bag-of-words (BOW) queries. Classic approaches based on query and document text vectors use term-weighting functions to rank the documents. Some o...

  • Article
  • Open Access
46 Citations
16,550 Views
17 Pages

Language Bias in the Google Scholar Ranking Algorithm

  • Cristòfol Rovira,
  • Lluís Codina and
  • Carlos Lopezosa

27 January 2021

The visibility of academic articles or conference papers depends on their being easily found in academic search engines, above all in Google Scholar. To enhance this visibility, search engine optimization (SEO) has been applied in recent years to aca...

  • Article
  • Open Access
2 Citations
3,137 Views
21 Pages

A DFT-Based Running Time Prediction Algorithm for Web Queries

  • Oscar Rojas,
  • Veronica Gil-Costa and
  • Mauricio Marin

4 August 2021

Web search engines are built from components capable of processing large amounts of user queries per second in a distributed way. Among them, the index service computes the top-k documents that best match each incoming query by means of a document ra...

  • Article
  • Open Access
7 Citations
3,052 Views
12 Pages

30 March 2020

The rapid growth of Internet technologies has led to an enormous increase in the number of electronic documents used worldwide. To organize and manage big data for unstructured documents effectively and efficiently, text categorization has been emplo...

  • Article
  • Open Access
13 Citations
3,486 Views
16 Pages

1 March 2019

Research front detection and topic evolution has for a long time been an important direction for research in the informetrics field. However, most previous studies either simply use a citation count for scientific document clustering or assume that e...

  • Article
  • Open Access
5 Citations
2,863 Views
20 Pages

Quantum Approach for Contextual Search, Retrieval, and Ranking of Classical Information

  • Alexander P. Alodjants,
  • Anna E. Avdyushina,
  • Dmitriy V. Tsarev,
  • Igor A. Bessmertny and
  • Andrey Yu. Khrennikov

13 October 2024

Quantum-inspired algorithms represent an important direction in modern software information technologies that use heuristic methods and approaches of quantum science. This work presents a quantum approach for document search, retrieval, and ranking b...

  • Article
  • Open Access
27 Citations
6,960 Views
21 Pages

2 July 2020

This study proposes the optimization method of the associative knowledge graph using TF-IDF based ranking scores. The proposed method calculates TF-IDF weights in all documents and generates term ranking. Based on the terms with high scores from TF-I...

  • Article
  • Open Access
6 Citations
4,023 Views
16 Pages

AR Search Engine: Semantic Information Retrieval for Augmented Reality Domain

  • Maryam Shakeri,
  • Abolghasem Sadeghi-Niaraki,
  • Soo-Mi Choi and
  • Tamer AbuHmed

25 November 2022

With the emergence of the metaverse, the popularity of augmented reality (AR) is increasing; accessing concise, accurate, and precise information in this field is becoming challenging on the world wide web. In regard to accessing the right informatio...

  • Article
  • Open Access
3 Citations
3,846 Views
18 Pages

20 March 2024

In this paper, we describe our biomedical document retrieval system and answers extraction module, which is part of the biomedical question answering system. Approximately 26.5 million PubMed articles are indexed as a corpus with the Apache Lucene te...

  • Article
  • Open Access
3 Citations
4,331 Views
28 Pages

14 September 2023

Over the past decade, knowledge bases (KB) have been increasingly utilized to complete and enrich the representation of queries and documents in order to improve the document retrieval task. Although many approaches have used KB for such purposes, th...

  • Proceeding Paper
  • Open Access
2,290 Views
3 Pages

Building High-Quality Datasets for Information Retrieval Evaluation at a Reduced Cost

  • David Otero,
  • Daniel Valcarce,
  • Javier Parapar and
  • Álvaro Barreiro

Information Retrieval is not any more exclusively about document ranking. Continuously new tasks are proposed on this and sibling fields. With this proliferation of tasks, it becomes crucial to have a cheap way of constructing test collections to eva...

  • Review
  • Open Access
14 Citations
15,953 Views
20 Pages

A Bibliometric Analysis of Objective and Subjective Risk

  • Haitham Nobanee,
  • Maryam Alhajjar,
  • Mohammed Ahmed Alkaabi,
  • Majed Musabah Almemari,
  • Mohamed Abdulla Alhassani,
  • Naema Khamis Alkaabi,
  • Saeed Abdulla Alshamsi and
  • Hanan Hamed AlBlooshi

4 July 2021

In relation to “objective risk” or “subjective risk”, a bibliometric analysis was performed using documents found in the Scopus database. A search for related documents was narrowed down to 192 documents and these were considered in this study. The r...

  • Article
  • Open Access
3 Citations
3,692 Views
15 Pages

To support evidence-based precision medicine and clinical decision-making, we need to identify accurate, appropriate, and clinically relevant studies from voluminous biomedical literature. To address the issue of accurate identification of high impac...

  • Article
  • Open Access
16 Citations
9,210 Views
20 Pages

From Geoportals to Geographic Knowledge Portals

  • Bernhard Vockner,
  • Andreas Richter and
  • Manfred Mittlböck

We present the application of Latent Semantic Analysis (LSA) in combination with recommender systems, in order to enhance discovery in geoportals. As a basis for discovery, metadata of spatial data and services, as well as of non-spatial resources, s...

  • Article
  • Open Access
4 Citations
3,526 Views
14 Pages

Document Recommendations and Feedback Collection Analysis within the Slovenian Open-Access Infrastructure

  • Mladen Borovič,
  • Marko Ferme,
  • Janez Brezovnik,
  • Sandi Majninger,
  • Klemen Kac and
  • Milan Ojsteršek

23 October 2020

This paper presents a hybrid document recommender system intended for use in digital libraries and institutional repositories that are part of the Slovenian Open Access Infrastructure. The recommender system provides recommendations of similar docume...

  • Article
  • Open Access
10 Citations
4,598 Views
16 Pages

18 January 2021

Automatic extractive text summarization retrieves a subset of data that represents most notable sentences in the entire document. In the era of digital explosion, which is mostly unstructured textual data, there is a demand for users to understand th...

  • Article
  • Open Access
1 Citations
2,171 Views
24 Pages

16 March 2024

Keyphrase extraction is a critical task in text information retrieval, which traditionally employs both supervised and unsupervised approaches. Supervised methods generally rely on large corpora, which introduce the problems of availability, while un...

  • Article
  • Open Access
5 Citations
3,871 Views
19 Pages

Forestry Research in the Middle East: A Bibliometric Analysis

  • Mohsen Fazeli-Varzaneh,
  • Pete Bettinger,
  • Erfan Ghaderi-Azad,
  • Marcin Kozak,
  • Davood Mafi-Gholami and
  • Abolfazl Jaafari

23 July 2021

Research trends in the field of forestry have experienced a significant evolution in recent years. However, there has been little use of bibliometric analyses to assess academic organizations and individual researchers in this field of science. This...

  • Article
  • Open Access
32 Citations
7,488 Views
27 Pages

The Influence of Feature Representation of Text on the Performance of Document Classification

  • Sanda Martinčić-Ipšić,
  • Tanja Miličić and
  • Ljupčo Todorovski

20 February 2019

In this paper we perform a comparative analysis of three models for a feature representation of text documents in the context of document classification. In particular, we consider the most often used family of bag-of-words models, the recently propo...

  • Article
  • Open Access
3 Citations
5,958 Views
16 Pages

In the recent big data era, massive spatial related data are continuously generated and scrambled from various sources. Acquiring accurate geographic information is also urgently demanded. How to accurately retrieve desired geographic information has...

  • Article
  • Open Access
1 Citations
5,976 Views
27 Pages

Urban documents like city planning reports and environmental data often feature complex charts and texts that require effective summarization tools, particularly in smart city management systems. These documents increasingly use graphical abstracts a...

  • Article
  • Open Access
2 Citations
3,110 Views
16 Pages

Multi-Layer Contextual Passage Term Embedding for Ad-Hoc Retrieval

  • Weihong Cai,
  • Zijun Hu,
  • Yalan Luo,
  • Daoyuan Liang,
  • Yifan Feng and
  • Jiaxin Chen

25 April 2022

Nowadays, pre-trained language models such as Bidirectional Encoder Representations from Transformer (BERT) are becoming a basic building block in Information Retrieval tasks. Nevertheless, there are several limitations when applying BERT to the quer...

  • Article
  • Open Access
4 Citations
3,941 Views
19 Pages

Diagnostic Evaluation of Policy-Gradient-Based Ranking

  • Hai-Tao Yu,
  • Degen Huang,
  • Fuji Ren and
  • Lishuang Li

Learning-to-rank has been intensively studied and has shown significantly increasing values in a wide range of domains, such as web search, recommender systems, dialogue systems, machine translation, and even computational biology, to name a few. In...

  • Article
  • Open Access
3 Citations
2,048 Views
18 Pages

Lightweight and Privacy-Preserving Multi-Keyword Search over Outsourced Data

  • Meng Zhao,
  • Lingang Liu,
  • Yong Ding,
  • Hua Deng,
  • Hai Liang,
  • Huiyong Wang and
  • Yujue Wang

22 February 2023

In cloud computing, documents can be outsourced to the cloud server to achieve flexible access control and efficient sharing among multiple users. The outsourced documents can be intelligently searched according to some keywords with the help of clou...

  • Feature Paper
  • Article
  • Open Access
8 Citations
4,504 Views
21 Pages

5 September 2021

Tools for Natural Language Processing work using linguistic resources, that are language-specific. The complexity of building such resources causes many languages to lack them. So, learning them automatically from sample texts would be a desirable so...

  • Article
  • Open Access
1 Citations
2,455 Views
15 Pages

Children and young people constitute a structurally vulnerable group who often experience specific barriers when trying to exercise their rights, including the right to health. The aim of this study was to examine core concepts of human rights and in...

  • Article
  • Open Access
10 Citations
6,266 Views
27 Pages

A Framework for Content-Based Search in Large Music Collections

  • Tiange Zhu,
  • Raphaël Fournier-S’niehotta,
  • Philippe Rigaux and
  • Nicolas Travers

We address the problem of scalable content-based search in large collections of music documents. Music content is highly complex and versatile and presents multiple facets that can be considered independently or in combination. Moreover, music docume...

  • Review
  • Open Access
93 Citations
8,866 Views
35 Pages

3 June 2022

This paper aims to comprehensively review 891 documents in the Scopus database about Internet of Things (IoT) in Ind 4.0 to understand the historical growth, current state, and potential expansion trend. From 2014 to 2020, a systematic methodology ga...

  • Article
  • Open Access
18 Citations
4,272 Views
14 Pages

12 April 2018

Document classification has a broad application in the field of sentiment classification, document ranking and topic labeling, etc. Previous neural network-based work has mainly focused on investigating a so-called forward implication, i.e., the prec...

  • Data Descriptor
  • Open Access
3 Citations
4,161 Views
11 Pages

27 September 2020

In this article, we introduce a dataset of curated learning paths (LPs) to support search as learning. LPs were obtained through an online survey delivered to experts in different domains. Data were then analyzed and described in terms of a set of va...

  • Article
  • Open Access
16 Citations
3,331 Views
20 Pages

14 November 2023

The building industry is one of the most resource-intensive sectors in industrialized countries, requiring a shift from a linear to a more sustainable circular economic model. Nevertheless, there are several major challenges, such as the management o...

of 10