- Article
Improved Script Identification Algorithm Using Unicode-Based Regular Expression Matching Strategy
- Mamtimin Qasim and
- Wushour Silamu
While script identification is the first step in many natural language processing and text mining tasks, at present, there is no open-source script identification algorithm for text. For this reason, we analyze the Unicode encoding of each type of sc...