Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (13)

Search Parameters:
Keywords = Monero

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
24 pages, 10501 KB  
Article
Unveiling Dark Web Identity Patterns: A Network-Based Analysis of Identification Types and Communication Channels in Illicit Activities
by Luis de-Marcos, Adrián Domínguez-Díaz, Javier Junquera-Sánchez, Carlos Cilleruelo and José-Javier Martínez-Herráiz
Information 2025, 16(11), 924; https://doi.org/10.3390/info16110924 - 22 Oct 2025
Cited by 1 | Viewed by 2211
Abstract
The Dark Web, a hidden segment of the internet, has become a hub for illicit activities, facilitated by various forms of digital identification (IDs) such as email addresses, Telegram accounts, and cryptocurrency wallets. This study conducts a comprehensive analysis of the Dark Web’s [...] Read more.
The Dark Web, a hidden segment of the internet, has become a hub for illicit activities, facilitated by various forms of digital identification (IDs) such as email addresses, Telegram accounts, and cryptocurrency wallets. This study conducts a comprehensive analysis of the Dark Web’s identification and communication patterns, focusing on the roles of different ID types and their associated activities. Using a dataset of Dark Web documents, we construct and analyze a bipartite network to model the relationships between IDs and web documents, employing graph–theoretical metrics such as degree centrality, closeness centrality, betweenness centrality, and k-core decomposition, while analyzing subnetworks formed by ID type. Our findings reveal that Telegram forms the backbone of the network, serving as the primary communication tool for hacking-related activities, particularly within Russian-speaking communities. In contrast, email plays a more decentralized role, facilitating finance–crypto and other activities but with a high level of fragmentation and English as the predominant language. XMR (Monero) wallets emerge as a key component in financial transactions, forming a cohesive subnetwork focused on cryptocurrency-related activities. The analysis also highlights the modular and hierarchical nature of the Dark Web, with distinct clusters for hacking, finance–crypto, and drugs–narcotics, often operating independently but with some cross-topic interactions. This study provides a foundation for understanding the Dark Web’s structure and dynamics, offering insights that can inform strategies for monitoring and mitigating its risks. Full article
(This article belongs to the Section Information Security and Privacy)
Show Figures

Graphical abstract

15 pages, 1149 KB  
Article
Paraoxonase-1 as a Cardiovascular Biomarker in Caribbean Hispanic Patients Treated with Clopidogrel: Abundance and Functionality
by Mariangeli Monero-Paredes, Ednalise Santiago, Kelvin Carrasquillo-Carrion, Jessicca Y. Renta, Igor B. Rogozin, Abiel Roche-Lima and Jorge Duconge
Int. J. Mol. Sci. 2024, 25(19), 10657; https://doi.org/10.3390/ijms251910657 - 3 Oct 2024
Cited by 1 | Viewed by 1724
Abstract
Clopidogrel, a prescription drug to reduce ischemic events in cardiovascular patients, has been extensively studied in mostly European individuals but not among Caribbean Hispanics. This study evaluated the low abundance and reduced activity of paraoxonase-1 (PON1) in clopidogrel-resistant patients as a predictive risk [...] Read more.
Clopidogrel, a prescription drug to reduce ischemic events in cardiovascular patients, has been extensively studied in mostly European individuals but not among Caribbean Hispanics. This study evaluated the low abundance and reduced activity of paraoxonase-1 (PON1) in clopidogrel-resistant patients as a predictive risk biomarker of poor responders and disease severity in this population. Thirty-six patients on clopidogrel (cases divided into poor and normal responders) were enrolled, along with 11 cardiovascular patients with no clopidogrel indications (positive control) and 13 healthy volunteers (negative control). Residual on-treatment platelet reactivity unit (PRU), PON1 abundance by Western blotting, and PON1 activity by enzymatic assays were measured. PON1 genotyping and computational haplotype phasing were performed on 512 DNA specimens for two genetic loci (rs662 and rs854560). No statistical differences in mean relative PON1 abundance were found among the groups (p > 0.05). However, a significantly lower enzymatic activity was found in poor responders (10.57 ± 6.79 µU/mL) when compared to controls (22.66 ± 8.30 µU/mL and 22.21 ± 9.66 µU/mL; p = 0.004). PON1 activity among carriers of the most prevalent PON1 haplotype (AA|AA) was significantly lower than in wild types (7.90 µU/mL vs. 22.03 µU/mL; p = 0.005). Our findings suggested that PON1 is a potential biomarker of cardiovascular disease severity in Caribbean Hispanics. Full article
(This article belongs to the Special Issue Biomarkers for the Diagnosis and Prognosis of Cardiovascular Disease)
Show Figures

Figure 1

17 pages, 1266 KB  
Article
Non-Random Enrichment of Single-Nucleotide Polymorphisms Associated with Clopidogrel Resistance within Risk Loci Linked to the Severity of Underlying Cardiovascular Diseases: The Role of Admixture
by Mariangeli Monero-Paredes, Roberto Feliu-Maldonado, Kelvin Carrasquillo-Carrion, Pablo Gonzalez, Igor B. Rogozin, Abiel Roche-Lima and Jorge Duconge
Genes 2023, 14(9), 1813; https://doi.org/10.3390/genes14091813 - 17 Sep 2023
Viewed by 2955
Abstract
Cardiovascular disease (CVD) is one of the leading causes of death in Puerto Rico, where clopidogrel is commonly prescribed to prevent ischemic events. Genetic contributors to both a poor clopidogrel response and the severity of CVD have been identified mainly in Europeans. However, [...] Read more.
Cardiovascular disease (CVD) is one of the leading causes of death in Puerto Rico, where clopidogrel is commonly prescribed to prevent ischemic events. Genetic contributors to both a poor clopidogrel response and the severity of CVD have been identified mainly in Europeans. However, the non-random enrichment of single-nucleotide polymorphisms (SNPs) associated with clopidogrel resistance within risk loci linked to underlying CVDs, and the role of admixture, have yet to be tested. This study aimed to assess the possible interaction between genetic biomarkers linked to CVDs and those associated with clopidogrel resistance among admixed Caribbean Hispanics. We identified 50 SNPs significantly associated with CVDs in previous genome-wide association studies (GWASs). These SNPs were combined with another ten SNPs related to clopidogrel resistance in Caribbean Hispanics. We developed Python scripts to determine whether SNPs related to CVDs are in close proximity to those associated with the clopidogrel response. The average and individual local ancestry (LAI) within each locus were inferred, and 60 random SNPs with their corresponding LAIs were generated for enrichment estimation purposes. Our results showed no CVD-linked SNPs in close proximity to those associated with the clopidogrel response among Caribbean Hispanics. Consequently, no genetic loci with a dual predictive role for the risk of CVD severity and clopidogrel resistance were found in this population. Native American ancestry was the most enriched within the risk loci linked to CVDs in this population. The non-random enrichment of disease susceptibility loci with drug-response SNPs is a new frontier in Precision Medicine that needs further attention. Full article
Show Figures

Graphical abstract

15 pages, 2164 KB  
Article
Sedation with Sevoflurane versus Propofol in COVID-19 Patients with Acute Respiratory Distress Syndrome: Results from a Randomized Clinical Trial
by Sara Martínez-Castro, Berta Monleón, Jaume Puig, Carolina Ferrer Gomez, Marta Quesada, David Pestaña, Alberto Balvis, Emilio Maseda, Alejandro Suárez de la Rica, Ana Monero Feijoo and Rafael Badenes
J. Pers. Med. 2023, 13(6), 925; https://doi.org/10.3390/jpm13060925 - 31 May 2023
Cited by 7 | Viewed by 3369
Abstract
Background: Acute respiratory distress syndrome (ARDS) related to COVID-19 (coronavirus disease 2019) led to intensive care units (ICUs) collapse. Amalgams of sedative agents (including volatile anesthetics) were used due to the clinical shortage of intravenous drugs (mainly propofol and midazolam). Methods: A multicenter, [...] Read more.
Background: Acute respiratory distress syndrome (ARDS) related to COVID-19 (coronavirus disease 2019) led to intensive care units (ICUs) collapse. Amalgams of sedative agents (including volatile anesthetics) were used due to the clinical shortage of intravenous drugs (mainly propofol and midazolam). Methods: A multicenter, randomized 1:1, controlled clinical trial was designed to compare sedation using propofol and sevoflurane in patients with ARDS associated with COVID-19 infection in terms of oxygenation and mortality. Results: Data from a total of 17 patients (10 in the propofol arm and 7 in the sevoflurane arm) showed a trend toward PaO2/FiO2 improvement and the sevoflurane arm’s superiority in decreasing the likelihood of death (no statistical significance was found). Conclusions: Intravenous agents are the most-used sedative agents in Spain, even though volatile anesthetics, such as sevoflurane and isoflurane, have shown beneficial effects in many clinical conditions. Growing evidence demonstrates the safety and potential benefits of using volatile anesthetics in critical situations. Full article
(This article belongs to the Special Issue New Paradigms in Anesthesia and Intensive Care)
Show Figures

Figure 1

13 pages, 2671 KB  
Article
A Forecasting Approach to Cryptocurrency Price Index Using Reinforcement Learning
by L. Thanga Mariappan, J. Arun Pandian, V. Dhilip Kumar, Oana Geman, Iuliana Chiuchisan and Carmen Năstase
Appl. Sci. 2023, 13(4), 2692; https://doi.org/10.3390/app13042692 - 19 Feb 2023
Cited by 9 | Viewed by 6590
Abstract
Cryptocurrency has emerged as a well-known significant component with both economic and financial potential in recent years. Unfortunately, Bitcoin acquisition is not simple, due to uneven business and significant rate fluctuations. Traditional approaches to price forecasting have proven incapable of proving adequate data [...] Read more.
Cryptocurrency has emerged as a well-known significant component with both economic and financial potential in recent years. Unfortunately, Bitcoin acquisition is not simple, due to uneven business and significant rate fluctuations. Traditional approaches to price forecasting have proven incapable of proving adequate data and solutions because prices can now be forecast in real time. We recommended a machine learning-based alternative for a mortgage lender based on highlighted problems in forecasting the price of Bitcoin. The proposed system included a reinforcement learning algorithm for price estimation and forecasting, as well as a blockchain framework for an efficient and secure environment. The proposed prediction, compared to other state-of-the-art strategies in this sector, demonstrated better performance. In this system, the proposed prediction reached improved consistency, in comparison to other systems, with respect to Monero (XMR), Litecoin (LTC), Oryen (ORY), and Bitcoin (BTC). Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

15 pages, 1384 KB  
Article
A Real-Time Hybrid Approach to Combat In-Browser Cryptojacking Malware
by Muhammad Haris Khan Abbasi, Subhan Ullah, Tahir Ahmad and Attaullah Buriro
Appl. Sci. 2023, 13(4), 2039; https://doi.org/10.3390/app13042039 - 4 Feb 2023
Cited by 12 | Viewed by 4986
Abstract
Cryptojacking is a type of computer piracy in which a hacker uses a victim’s computer resources, without their knowledge or consent, to mine for cryptocurrency. This is made possible by new memory-based cryptomining techniques and the growth of new web technologies such as [...] Read more.
Cryptojacking is a type of computer piracy in which a hacker uses a victim’s computer resources, without their knowledge or consent, to mine for cryptocurrency. This is made possible by new memory-based cryptomining techniques and the growth of new web technologies such as WebAssembly, allowing mining to occur within a browser. Most of the research in the field of cryptojacking has focused on detection methods rather than prevention methods. Some of the detection methods proposed in the literature include using static and dynamic features of in-browser cryptojacking malware, along with machine learning algorithms such as Support Vector Machine (SVM), Random Forest (RF), and others. However, these methods can be effective in detecting known cryptojacking malware, but they may not be able to detect new or unknown variants. The existing prevention methods are shown to be effective only against web-assembly (WASM)-based cryptojacking malware and cannot handle mining service-providing scripts that use non-WASM modules. This paper proposes a novel hybrid approach for detecting and preventing web-based cryptojacking. The proposed approach performs the real-time detection and prevention of in-browser cryptojacking malware, using the blacklisting technique and statistical code analysis to identify unique features of non-WASM cryptojacking malware. The experimental results show positive performances in the ease of use and efficiency, with the detection accuracy improved from 97% to 99.6%. Moreover, the time required to prevent already known malware in real time can be decreased by 99.8%. Full article
(This article belongs to the Special Issue Information Security and Privacy)
Show Figures

Figure 1

14 pages, 446 KB  
Article
On Forecasting Cryptocurrency Prices: A Comparison of Machine Learning, Deep Learning, and Ensembles
by Kate Murray, Andrea Rossi, Diego Carraro and Andrea Visentin
Forecasting 2023, 5(1), 196-209; https://doi.org/10.3390/forecast5010010 - 29 Jan 2023
Cited by 69 | Viewed by 26465
Abstract
Traders and investors are interested in accurately predicting cryptocurrency prices to increase returns and minimize risk. However, due to their uncertainty, volatility, and dynamism, forecasting crypto prices is a challenging time series analysis task. Researchers have proposed predictors based on statistical, machine learning [...] Read more.
Traders and investors are interested in accurately predicting cryptocurrency prices to increase returns and minimize risk. However, due to their uncertainty, volatility, and dynamism, forecasting crypto prices is a challenging time series analysis task. Researchers have proposed predictors based on statistical, machine learning (ML), and deep learning (DL) approaches, but the literature is limited. Indeed, it is narrow because it focuses on predicting only the prices of the few most famous cryptos. In addition, it is scattered because it compares different models on different cryptos inconsistently, and it lacks generality because solutions are overly complex and hard to reproduce in practice. The main goal of this paper is to provide a comparison framework that overcomes these limitations. We use this framework to run extensive experiments where we compare the performances of widely used statistical, ML, and DL approaches in the literature for predicting the price of five popular cryptocurrencies, i.e., XRP, Bitcoin (BTC), Litecoin (LTC), Ethereum (ETH), and Monero (XMR). To the best of our knowledge, we are also the first to propose using the temporal fusion transformer (TFT) on this task. Moreover, we extend our investigation to hybrid models and ensembles to assess whether combining single models boosts prediction accuracy. Our evaluation shows that DL approaches are the best predictors, particularly the LSTM, and this is consistently true across all the cryptos examined. LSTM reaches an average RMSE of 0.0222 and MAE of 0.0173, respectively, 2.7% and 1.7% better than the second-best model. To ensure reproducibility and stimulate future research contribution, we share the dataset and the code of the experiments. Full article
Show Figures

Figure 1

22 pages, 5807 KB  
Article
MinerGuard: A Solution to Detect Browser-Based Cryptocurrency Mining through Machine Learning
by Min-Hao Wu, Yen-Jung Lai, Yan-Ling Hwang, Ting-Cheng Chang and Fu-Hau Hsu
Appl. Sci. 2022, 12(19), 9838; https://doi.org/10.3390/app12199838 - 29 Sep 2022
Cited by 8 | Viewed by 10874
Abstract
Coinhive released its browser-based cryptocurrency mining code in September 2017, and vicious web page writers, called vicious miners hereafter, began to embed mining JavaScript code into their web pages, called mining pages hereafter. As a result, browser users surfing these web pages will [...] Read more.
Coinhive released its browser-based cryptocurrency mining code in September 2017, and vicious web page writers, called vicious miners hereafter, began to embed mining JavaScript code into their web pages, called mining pages hereafter. As a result, browser users surfing these web pages will benefit mine cryptocurrencies unwittingly for the vicious miners using the CPU resources of their devices. The above activity, called Cryptojacking, has become one of the most common threats to web browser users. As mining pages influence the execution efficiency of regular programs and increase the electricity bills of victims, security specialists start to provide methods to block mining pages. Nowadays, using a blocklist to filter out mining scripts is the most common solution to this problem. However, when the number of new mining pages increases quickly, and vicious miners apply obfuscation and encryption to bypass detection, the detection accuracy of blacklist-based or feature-based solutions decreases significantly. This paper proposes a solution, called MinerGuard, to detect mining pages. MinerGuard was designed based on the observation that mining JavaScript code consumes a lot of CPU resources because it needs to execute plenty of computation. MinerGuard does not need to update data used for detection frequently. On the contrary, blacklist-based or feature-based solutions must update their blocklists frequently. Experimental results show that MinerGuard is more accurate than blacklist-based or feature-based solutions in mining page detection. MinerGuard’s detection rate for mining pages is 96%, but MinerBlock, a blacklist-based solution, is 42.85%. Moreover, MinerGuard can detect 0-day mining pages and scripts, but the blacklist-based and feature-based solutions cannot. Full article
(This article belongs to the Special Issue Human-Computer Interactions 2.0)
Show Figures

Figure 1

17 pages, 1815 KB  
Article
Is Bitcoin Still a King? Relationships between Prices, Volatility and Liquidity of Cryptocurrencies during the Pandemic
by Barbara Będowska-Sójka, Agata Kliber and Aleksandra Rutkowska
Entropy 2021, 23(11), 1386; https://doi.org/10.3390/e23111386 - 22 Oct 2021
Cited by 12 | Viewed by 6303
Abstract
We try to establish the commonalities and leadership in the cryptocurrency markets by examining the mutual information and lead-lag relationships between Bitcoin and other cryptocurrencies from January 2019 to June 2021. We examine the transfer entropy between volatility and liquidity of seven highly [...] Read more.
We try to establish the commonalities and leadership in the cryptocurrency markets by examining the mutual information and lead-lag relationships between Bitcoin and other cryptocurrencies from January 2019 to June 2021. We examine the transfer entropy between volatility and liquidity of seven highly capitalized cryptocurrencies in order to determine the potential direction of information flow. We find that cryptocurrencies are strongly interrelated in returns and volatility but less in liquidity. We show that smaller and younger cryptocurrencies (such as Ripple’s XRP or Litecoin) have started to affect the returns of Bitcoin since the beginning of the pandemic. Regarding liquidity, the results of the dynamic time warping algorithm also suggest that the position of Monero has increased. Those outcomes suggest the gradual increase in the role of privacy-oriented cryptocurrencies. Full article
(This article belongs to the Special Issue Entropy-Based Applications in Economics, Finance, and Management)
Show Figures

Figure 1

15 pages, 1173 KB  
Article
On Prices of Privacy Coins and Bitcoin
by Olli-Pekka Hilmola
J. Risk Financial Manag. 2021, 14(8), 361; https://doi.org/10.3390/jrfm14080361 - 6 Aug 2021
Cited by 7 | Viewed by 7382
Abstract
Since the inauguration of cryptocurrencies, Bitcoin has been under pressure from competing tokens. As Bitcoin is a public open ledger blockchain coin, it has its weaknesses in privacy and anonymity. In the recent decade numerous coins have been initiated as privacy coins, which [...] Read more.
Since the inauguration of cryptocurrencies, Bitcoin has been under pressure from competing tokens. As Bitcoin is a public open ledger blockchain coin, it has its weaknesses in privacy and anonymity. In the recent decade numerous coins have been initiated as privacy coins, which try to tackle these weaknesses. This research compares mostly mature privacy coins to Bitcoin, and comparison is made from a price perspective. It seems that Bitcoin is leading privacy coins in price terms, and correlation is typically high and positive. From the earlier crypto market peak of 2017–18, only a very small number of coins are showing positive returns in 2021. It is typical that many privacy coins have lost substantial amounts of their value (ranging 80–90%) or that they do not exist anymore at all. Only Horizen and Monero have shown long-term sustainability in their value; however, their price changes follow that of Bitcoin very closely. The role of privacy coins in the future remains as an open issue. Full article
(This article belongs to the Special Issue FinTech and the Future of Finance)
Show Figures

Figure 1

36 pages, 4845 KB  
Article
Are Cryptocurrencies a Backstop for the Stock Market in a COVID-19-Led Financial Crisis? Evidence from the NARDL Approach
by Ahmed Jeribi, Sangram Keshari Jena and Amine Lahiani
Int. J. Financial Stud. 2021, 9(3), 33; https://doi.org/10.3390/ijfs9030033 - 22 Jun 2021
Cited by 37 | Viewed by 5997
Abstract
The study investigates the safe haven properties and sustainability of the top five cryptocurrencies (Bitcoin, Ethereum, Dash, Monero, and Ripple) and gold for BRICS stock markets during the COVID-19 crisis period from 31 January 2020 to 17 September 2020 in comparison to the [...] Read more.
The study investigates the safe haven properties and sustainability of the top five cryptocurrencies (Bitcoin, Ethereum, Dash, Monero, and Ripple) and gold for BRICS stock markets during the COVID-19 crisis period from 31 January 2020 to 17 September 2020 in comparison to the precrisis period from 1 January 2016 to 30 January 2020, in a nonlinear and asymmetric framework using Nonlinear Autoregressive Distributed Lag (NARDL) methodology. Our results show that the relationship dynamics of stock market and cryptocurrency returns both in the short and long run are changing during the COVID-19 crisis period, which justifies our study using the nonlinear and asymmetric model. As far as a sustainable safe haven is concerned, Dash and Ripple are found to be a safe haven for all the five markets before the pandemic. However, all five cryptocurrencies are found to be a safe haven for three emerging markets, such as Brazil, China, and Russia, during the financial crisis. In a comparative framework, gold is found to be a suitable safe haven only for Brazil and Russia. The results have implications for index fund managers of BRICS markets to include Dash and Ripple in their portfolio as safe haven assets to protect its value during a stock market crisis. Full article
(This article belongs to the Special Issue Financial Markets under Public Emergency)
Show Figures

Figure 1

26 pages, 2019 KB  
Article
Cryptocurrencies Perception Using Wikipedia and Google Trends
by Piotr Stolarski, Włodzimierz Lewoniewski and Witold Abramowicz
Information 2020, 11(4), 234; https://doi.org/10.3390/info11040234 - 24 Apr 2020
Cited by 12 | Viewed by 16112
Abstract
In this research we presented different approaches to investigate the possible relationships between the largest crowd-based knowledge source and the market potential of particular cryptocurrencies. Identification of such relations is crucial because their existence may be used to create a broad spectrum of [...] Read more.
In this research we presented different approaches to investigate the possible relationships between the largest crowd-based knowledge source and the market potential of particular cryptocurrencies. Identification of such relations is crucial because their existence may be used to create a broad spectrum of analyses and reports about cryptocurrency projects and to obtain a comprehensive outlook of the blockchain domain. The activities on the blockchain reach different levels of anonymity which renders them hard objects of studies. In particular, the standard tools used to characterize social trends and variables that describe cryptocurrencies’ situations are unsuitable to be used in the environment that extensively employs cryptographic techniques to hide real users. The employment of Wikipedia to trace crypto assets value need examination because the portal allows gathering of different opinions—content of the articles is edited by a group of people. Consequently, the information can be more attractive and useful for the readers than in case of non-collaborative sources of information. Wikipedia Articles often appears in the premium position of such search engines as Google, Bing, Yahoo and others. One may expect different demand on information about particular cryptocurrency depending on the different events (e.g., sharp fluctuations of price). Wikipedia offers only information about cryptocurrencies that are important from the point of view of language community of the users in Wikipedia. This “filter” helps to better identify those cryptocurrencies that have a significant influence on the regional markets. The models encompass linkages between different variables and properties. In one model cryptocurrency projects are ranked with the means of articles sentiment and quality. In another model, Wikipedia visits are linked to cryptocurrencies’ popularity. Additionally, the interactions between information demand in different Wikipedia language versions are elaborated. They are used to assess the geographical esteem of certain crypto coins. The information about the legal status of cryptocurrency technologies in different states that are offered by Wikipedia is used in another proposed model. It allows assessment of the adoption of cryptocurrencies in a given legislature. Finally, a model is developed that joins Wikipedia articles editions and deletions with the social sentiment towards particular cryptocurrency projects. The mentioned analytical purposes that permit assessment of the popularity of blockchain technologies in different local communities are not the only results of the paper. The models can show which country has the biggest demand on particular cryptocurrencies, such as Bitcoin, Ethereum, Ripple, Bitcoin Cash, Monero, Litecoin, Dogecoin and others. Full article
(This article belongs to the Special Issue Blockchain and Smart Contract Technologies)
Show Figures

Figure 1

12 pages, 233 KB  
Article
Towards the Construction of a Gold Standard Biomedical Corpus for the Romanian Language
by Maria Mitrofan, Verginica Barbu Mititelu and Grigorina Mitrofan
Data 2018, 3(4), 53; https://doi.org/10.3390/data3040053 - 23 Nov 2018
Cited by 6 | Viewed by 4216
Abstract
Gold standard corpora (GSCs) are essential for the supervised training and evaluation of systems that perform natural language processing (NLP) tasks. Currently, most of the resources used in biomedical NLP tasks are mainly in English. Little effort has been reported for other languages [...] Read more.
Gold standard corpora (GSCs) are essential for the supervised training and evaluation of systems that perform natural language processing (NLP) tasks. Currently, most of the resources used in biomedical NLP tasks are mainly in English. Little effort has been reported for other languages including Romanian and, thus, access to such language resources is poor. In this paper, we present the construction of the first morphologically and terminologically annotated biomedical corpus of the Romanian language (MoNERo), meant to serve as a gold standard for biomedical part-of-speech (POS) tagging and biomedical named entity recognition (bioNER). It contains 14,012 tokens distributed in three medical subdomains: cardiology, diabetes and endocrinology, extracted from books, journals and blogposts. In order to automatically annotate the corpus with POS tags, we used a Romanian tag set which has 715 labels, while diseases, anatomy, procedures and chemicals and drugs labels were manually annotated for bioNER with a Cohen Kappa coefficient of 92.8% and revealed the occurrence of 1877 medical named entities. The automatic annotation of the corpus has been manually checked. The corpus is publicly available and can be used to facilitate the development of NLP algorithms for the Romanian language. Full article
(This article belongs to the Special Issue Curative Power of Medical Data)
Show Figures

Figure 1

Back to TopTop