Predicting Risks of Machine Translations of Public Health Resources by Developing Interpretable Machine Learning Classifiers
Abstract
:1. Introduction
2. Related Work
3. Materials and Methods
3.1. Collection and Selection of WHO Public Health Education Resources
3.2. Statistical Analysis
4. Screening and Classification of Machine Translation Mistakes
5. Classification of Clinically Significant MT Errors
6. Training and Testing of Machine Learning Classifiers
7. Optimisation Techniques
8. Refinement of Automatically Optimised Features
9. Multinominal Naïve Bayes (MNB) Classifiers
10. Results
10.1. Comparison of Performance of Classifiers
10.2. Area under the Curve of Receiver Operator (AUC of ROC), Sensitivity, and Specificity
10.3. Classifier Scalability
11. Discussions
Probabilistic Outputs
12. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A. Types of Clinically Misleading Machine Translation Errors
Appendix A.1. Conceptual Mistakes
- Example 1
- Original English: Following inhalation of the toxin, symptoms become visible between 1–3 days, with longer onset times for lower levels of intoxication.
- Human: 吸入毒素后,症状在1–3天后可见。中毒程度越低,症状显露所需时间越长。
- Human: After inhaling the toxin, symptoms become visible within 1–3 days. The lower the intoxication, the longer it took for the symptoms to show.
- MT: 吸入毒素后,症状会在1–3天内出现,中毒程度较低时,发作时间较长。
- MT: After inhaling the toxin, symptoms become visible within 1–3 days. The lower the intoxication, the longer the outbreak.
- Example 2
- Original: The different mosquito species live in different habitats—some breed around houses (domestic), others in the jungle (wild), and some in both habitats (semi-domestic).
- Human: 不同的蚊种生活在不同的栖息地—有些在房屋周围(家居环境)繁殖,有些在野外丛林中繁殖,还有些可在两种环境中都繁殖(半家居环境)。
- Human: Different mosquito species live in different habitats—some breed around houses (domestic), others in jungles, some in both environments (semi-domestic).
- MT: 不同种类的蚊子生活在不同的栖息地—有些在房屋周围繁殖(国内),有些在丛林中(野生),有些在两个栖息地(半国内)。
- MT: Different mosquito species live in different habitats—some breed around houses (national), others in jungles and some in both places (semi-national).
- Example 3
- Original: People who inject drugs can take precautions against becoming infected with HIV by using sterile injecting equipment for each injection, and not sharing drug-using equipment and drug solutions.
- Human: 注射吸毒者可以采取措施避免感染艾滋病毒,包括每次注射都使用无菌注射设备,不共享吸毒工具和毒品溶液。
- Human: People who inject drugs can avoid being infected with HIV with preventative measures, such as using sterile injection equipment each time, and not sharing drug equipment and drug solutions.
- MT: 注射吸毒者可以通过每次注射使用无菌注射设备,并且不共用吸毒设备和药物溶液来预防感染艾滋病毒。
- MT: People who inject drugs can avoid being infected with HIV with preventative measures, such as using sterile injection equipment each time, and not sharing drug equipment and medicine solutions.
- Example 4
- Original: For men who have sex with men “event driven” PrEP is also an effective PrEP option. This is taking two pills sex between two and 24 h in before sex: then, a third pill 24 h after the first two pills, and a fourth pill 48 h after the first two pills.
- Human: 对于男性行为者来说,“按需启动”暴露前预防也是一种有效选项。具体就是在发生性行为前2到24小时服用两片;然后,在首次服药后24小时服用第三片,首次服药后48小时服用第四片。
- Human: For men who have sex with men, prevention before exposure based on ‘driven by needs’ model be an effective option. Specifically, this means to take 2 pills 2–24 h before sex; then, take the 3rd pill 24 h after the first pill, and the fourth pill 48 h after the 4th pill.
- MT: 对于与男性发生性关系的男性,“事件驱动”PrEP也是一种有效的PrEP选择。这是在性交前的2到24小时之间服用两粒药;然后,在前两粒药丸后24小时服用第三粒药丸,在前两粒药丸后48小时服用第四粒药丸。
- MT: For men who have sex with men, driven by events PrEP is an effective PrEP option. This is to take 2 pills 2–24 h before sex; then, the 3rd pill 24 h after the first 2 pills and the 4th pill 48 h after the first 2 pills.
- Example 5
- Original: Serological or other testing and culling can also be effective in areas with low prevalence.
- Human: 血清学或其他检测和扑杀措施在低流行率地区也是有效的。
- Human: Serological or other testing and killing can be effective in areas with low prevalence.
- MT: 血清学或其他检测和剔除在流行率低的地区也很有效。
- MT: Serological or other testing and removal can be very effective in areas with low prevalence.
- Example 6
- Original: HSV-1 (Herpes Simplex) is most contagious during an outbreak of symptomatic oral herpes but can also be transmitted when no symptoms are felt or visible.
- Human: 1型单纯疱疹病毒在有症状的口腔疱疹发作期间的传染性最强,但当没有症状或者症状不明显时也可出现传播。
- Human: HSV-1 is most contagious during the outbreak of symptomatic oral herpes but can also be transmitted when no symptoms are shown or less visible.
- MT: HSV-1在有症状的口腔疱疹爆发期间最具传染性,但也可以在没有感觉或可见症状时传播。
- MT: HSV-1 is most contagious during the outbreak (among people) of symptomatic oral herpes but can also be transmitted when no symptoms are shown or less visible.
- Example 7
- Original: The transmission of HIV from an HIV-positive mother to her child during pregnancy, labour, delivery or breastfeeding is called vertical or mother-to-child transmission (MTCT).
- Human: 艾滋病毒阳性母亲在妊娠,生产,娩出或哺乳期间将艾滋病毒传给婴儿称为垂直传播或母婴传播。
- Human: The transmission of HIV from an HIV-positive mother to her child during pregnancy, labour, delivery or breastfeeding is called vertical or mother-to-child transmission (MTCT).
- MT: 艾滋病毒阳性母亲在怀孕、分娩、分娩或哺乳期间将艾滋病毒传播给她的孩子称为垂直传播或母婴传播 (MTCT)。
- MT: The transmission of HIV from an HIV-positive mother to her child during pregnancy, labour, labour or breastfeeding is called vertical or mother-to-child transmission (MTCT).
Appendix A.2. Logical Confusion
- Example 8
- Original: HIV self-testing is a process whereby a person who wants to know his or her HIV status collects a specimen, performs a test, and interprets the test results in private or with someone they trust.
- Human: 当人们希望得知其艾滋病毒感染状况时可以采用自检程序,他们可以私下或与信得过的人一起采集标本,进行检测并解读检测结果。
- Human: (a person wanting to know his HIV status) can privately or with someone they trust collect a specimen, perform a test, and interpret the test results.
- MT: HIV自我检测是一个过程,希望了解自己的HIV状态的人收集样本,进行检测,并私下或与他们信任的人解释检测结果。
- MT: (a person wanting to know his HIV status) can collect a specimen, perform a test, and then privately or explain the test results to people they trust.
- Example 9
- Original:Occasionally humans working or travelling in the forest are bitten by infected mosquitoes and develop yellow fever.
- Human: 在森林中工作或旅行的人偶尔会被受感染的蚊子叮咬并染上黄热病。
- Human: People working or travelling in the forest, occasionally are bitten by infected mosquitoes and develop yellow fever.
- MT:偶尔在森林中工作或旅行的人会被受感染的蚊子叮咬并患上黄热病。
- MT: People who occasionally work or travel in the forest are bitten by infected mosquitoes and develop yellow fever.
- Example 10
- Original: Medical male circumcision reduces the risk of heterosexually acquired HIV infection in men by approximately 50% including in ‘real world’ settings where scale up occurred alongside the increasing coverage of ART with its secondary prevention effect.
- Human: 男性自愿医疗包皮环切术可将男性通过异性性行为方式获得艾滋病毒感染的风险降低50%,包括在“现实世界”环境中。随着抗逆转录病毒疗法及其二级预防效果覆盖面的扩大,这种做法也在增加。
- Human: Medical male circumcision can reduce the risks of men contracting HIV through heterosexual activities by about 50%, including in ‘real world’. With the increasing application of antiretroviral therapy and its secondary prevention effect, this method (medical male circumcision) was on the rise.
- MT: 医学男性包皮环切术可将男性异性性感染HIV感染的风险降低约50%,包括在“现实世界”环境中,随着ART的覆盖范围不断扩大,其二级预防效果不断扩大。
- MT: Medical male circumcision can reduce the risks of men contracting HIV through heterosexual activities by about 50%, including in ‘real world’. With the increasing application of ART, its secondary prevention effect increased as well.
References
- Hershberg, P.I.; Goldfinger, S.E.; Lemon, F.R.; Fessel, W.J. Medical record as index of quality of care. N. Engl. J. Med. 1972, 286, 725–726. [Google Scholar] [PubMed]
- Rosenbloom, S.T.; Denny, J.; Xu, H.; Lorenzi, N.; Stead, W.; Johnson, K.B. Data from clinical notes: A perspective on the tension between structure and flexible documentation. J. Am. Med. Inform. Assoc. 2011, 18, 181–186. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Lysanets, Y.V.; Bieliaieva, O.M. The use of Latin terminology in medical case reports: Quantitative, structural, and thematic analysis. J. Med. Case Rep. 2018, 12, 1–10. [Google Scholar] [CrossRef] [PubMed]
- Marecková, E.; Simon, F.; Cervený, L. Latin as the language of medical terminology: Some remarks on its role and prospects. Swiss Med. Wkly. 2002, 132, 581–857. [Google Scholar] [PubMed]
- Sharma, S. How to Become a Competent Medical Writer? Perspect. Clin. Res. 2010, 1, 33–37. [Google Scholar] [PubMed]
- Khoong, E.C.; Steinbrook, E.; Brown, C.; Fernandez, A. Assessing the Use of Google Translate for Spanish and Chinese Translations of Emergency Department Discharge Instructions. JAMA Intern. Med. 2019, 179, 580. [Google Scholar] [CrossRef] [PubMed]
- Kirchhoff, K.; Turner, A.M.; Axelrod, A.; Saavedra, F. Application of statistical machine translation to public health infor-mation: A feasibility study. J. Am. Med. Inform. Assoc. 2011, 18, 473–478. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Aymerich, J. Using Machine Translation for Fast, Inexpensive, and Accurate Health Information Assimilation and Dissemi-nation: Experiences at the Pan American Health Organization. In Proceedings of the 9th World Congress on Health Information and Libraries; Pan American Health Organization: Salvador da Bahia, Brazil, 2005. [Google Scholar]
- Hasan, A.; Alam, F.; Chowdhury, S.A.; Khan, N. Neural vs Statistical Machine Translation: Revisiting the Bangla-English Language Pair. In 2019 International Conference on Bangla Speech and Language Processing (ICBSLP); IEEE: Sylhet, Bangladesh, 2019; pp. 1–5. [Google Scholar]
- Poibeau, T. Machine Translation; MIT Press: Cambridge, MA, USA, 2017. [Google Scholar]
- Koehn, P. Statistical Machine Translation; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar]
- Koehn, P.; Knowles, R. Six Challenges for Neural Machine Translation. arXiv 2017, arXiv:1706.03872. Available online: https://arxiv.org/abs/1706.03872 (accessed on 3 July 2021).
- Martins, D.B.D.J.; Caseli, H.D.M. Automatic machine translation error identification. Comput. Transl. 2014, 29, 1–24. [Google Scholar] [CrossRef]
- Flesch, R. A new readability yardstick. J. Appl. Psychol. 1948, 32, 221. [Google Scholar] [CrossRef] [PubMed]
- Gunning, R. Technique of Clear Writing; McGraw-Hill: New York, NY, USA, 1952. [Google Scholar]
- Mc Laughlin, G.H. SMOG grading-a new readability formula. J. Read. 1969, 12, 639–646. [Google Scholar]
- Way, A. Machine translation: Where are we at today? In The Bloomsbury Companion to Language Industry Studies; Angelone, E., Massey, G., Ehrens-Berger-Dow, M., Eds.; Bloomsbury Publishing: London, UK, 2018. [Google Scholar]
- Piao, S.; Bianchi, F.; Dayrell, C.; D’Egidio, A.; Rayson, P. Development of the Multilingual Semantic Annotation System. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies; Association for Computational Linguistics (ACL): Denver, CO, USA, 2015; pp. 1268–1274. [Google Scholar]
- McCallum, A.; Nigam, K. A comparison of event models for naive bayes text classi-fication. In AAAI-98 Workshop on Learning for Text Categorization; The Association for the Advancement of Artificial Intelligence (AAAI): Palo Alto, CA, USA, 1998; pp. 41–48. [Google Scholar]
- Abbas, M.; Memon, K.A.; Jamali, A.A.; Memon, S.; Ahmed, A. Multinomial Naive Bayes classification model for sentiment analysis. Int. J. Comput. Sci. Netw. Secur. 2019, 19, 62–67. [Google Scholar]
- Revathy, S.; Niranjani, R.; Kanushya, R.J. Health Care Counselling Via Voicebot Using Multinomial Naive Bayes Algorithm. In Proceedings of the 2020 5th International Conference on Communication and Electronics Systems (ICCES), Coinbatore, India, 10–12 June 2020. [Google Scholar]
- Sulieman, L.; Robinson, J.R.; Jackson, G.P. Automating the Classification of Complexity of Medical Decision-Making in Patient-Provider Messaging in a Patient Portal. J. Surg. Res. 2020, 255, 224–232. [Google Scholar] [CrossRef] [PubMed]
- Awwalu, J.; Umar, N.A.; Ibrahim, M.S.; Nonyelum, O.F. A multinomial Naïve Bayes decision support system for COVID-19 detection. FUDMA J. Sci. 2020, 4, 704–711. [Google Scholar] [CrossRef]
Types | Subtypes | Examples (For Full Examples See Appendix A) (Translated to Chinese) |
---|---|---|
Conceptual mistakes (word level) | Related to epidemiology such as disease or virus transmission | English: longer onset times for lower levels of intoxication. Human: it took longer for symptoms to reveal when the intoxication was low. MT: the attacks of symptoms took longer when the intoxication was low. |
English: different mosquito species live in different habitats; some breed around houses (domestic), others in jungles (wild), and some in both habitats (semi-domestic). Human: different mosquito species live in different habitats; some breed around houses (domestic), others in jungles, and some in both environments (semi-domestic). MT: different mosquito species live in different habitats; some breed around houses (national), others in jungles, and some in both places (semi-national). | ||
Related to medical, clinical, and procedural measures | English: serological or other testing; culling can be effective in areas with low prevalence. Human: serological or other testing; killing can be effective in areas with low prevalence. MT: serological or other testing; removal can be effective in areas with low prevalence. | |
Logical Confusion (sentence level) | Related to epidemiology such as disease or virus transmission | English: occasionally, humans working or travelling in the forest are bitten by infected mosquitoes and develop yellow fever. Human: People working or travelling in the forest are occasionally bitten by infected mosquitoes and develop yellow fever. MT: People who occasionally work or travel in the forest are bitten by infected mosquitoes and develop yellow fever. |
Related to medical, clinical, and procedural measures | English: medical male circumcision reduces the risk of heterosexually acquired HIV infection in men by approximately 50%, including in ‘real world’ settings where scale up occurred alongside the increasing coverage of ART with its secondary prevention effect. Human: Medical male circumcision can reduce the risks of men contracting HIV through heterosexual activities by about 50%, including in ‘real world’. With the increasing application of antiretroviral therapy and its secondary prevention effect, this method (medical male circumcision) was on the rise. MT: Medical male circumcision can reduce the risks of men contracting HIV through heterosexual activities by about 50%, including in ‘real world’. With the increasing application of ART, its secondary prevention effect increased as well. | |
English: a person wanting to know his HIV status collects a specimen, performs a test, and interprets the test results in private or with someone they trust. Human: a person wanting to know his HIV status can privately or with someone they trust collect a specimen, perform a test, and interpret the test results. MT: a person wanting to know his HIV status can collect a specimen, perform a test, and then privately or explain the test results to people they trust. |
Test Result Variable | AUC | S.E. | Asymptotic Sig a | Asymptotic 95% CI | |
---|---|---|---|---|---|
Number of difficult sentences | 0.534 | 0.043 | 0.428 | 0.450 | 0.618 |
Average number of characters | 0.474 | 0.045 | 0.566 | 0.386 | 0.563 |
Average number of syllables | 0.500 | 0.045 | 0.992 | 0.411 | 0.588 |
Passive voice | 0.532 | 0.044 | 0.461 | 0.447 | 0.618 |
Sentences that begin with conjunctions | 0.508 | 0.006 | 0.156 | 0.497 | 0.520 |
Optimised Feature Sets | AUC | S.E. | Asymptotic Sig a | Asymptotic 95% Confidence Interval | |
---|---|---|---|---|---|
MLS Optimisation (5) | 0.601 | 0.043 | 0.018 | 0.518 | 0.685 |
Refined CFJO (6) | 0.652 | 0.042 | 0.000 | 0.570 | 0.733 |
Semantic Optimisation (10) | 0.653 | 0.042 | 0.000 | 0.571 | 0.735 |
Enhanced CFJO (8) | 0.672 | 0.040 | 0.000 | 0.593 | 0.752 |
CFJO (22) | 0.778 | 0.034 | 0.000 | 0.711 | 0.846 |
Optimisation Techniques | Training Data | Test Data | |||
---|---|---|---|---|---|
AUC Mean (SD) | AUC (SD) | Accuracy (SD) | Sensitivity (SD) | Specificity (SD) | |
Binary Classifiers using popular readability tools | |||||
Flesch Reading Ease Scores (60) | / | 0.3699 | 0.5000 | 1.000 | 0.0667 |
Gunning Fog Index (12) | / | 0.4263 | 0.5000 | 0.8462 | 0.2000 |
SMOG Index (12) | / | 0.4051 | 0.4643 | 0.9231 | 0.0667 |
MNB with Full, Non-Optimised Feature Sets | |||||
MLS Full Feature Set (20) | 0.5638 (0.139) | 0.5314 (0.0648) | 0.5893 (0.1452) | 0.4615 (0.1565) | 0.7000 (0.1339) |
Semantics Full Feature Set (115) | 0.5709 (0.116) | 0.6538 (0.0475) | 0.6429 (0.1225) | 0.8846 (0.1003) | 0.4333 (0.1448) |
Structure + Semantics Full (135) | 0.5776 (0.187) | 0.6308 (0.0062) | 0.5714 (0.1499) | 0.5769 (0.1551) | 0.5667 (0.1448) |
MNB with Automatically Optimised Feature Sets | |||||
MLS Optimised Feature Set (5) | 0.5385 (0.110) | 0.5256 (0.0289) | 0.4821 (0.0677) | 0.9615 (0.0604) | 0.0667 (0.0555) |
Semantics Optimised Feature Set (10) | 0.5529 (0.042) | 0.7256 (0.0145) | 0.6607 (0.1385) | 0.7692 (0.1322) | 0.5667 (0.1448) |
Combined separately optimised feature (CFSO) (15) (Semantics Optimised 10 + MLS Optimised 5 features) | 0.569 (0.068) | 0.6744 (0.0305) | 0.6250 (0.1392) | 0.7692 (0.1322) | 0.5000 (0.1460) |
MLS-Semantics jointly optimised (CFJO) (22) | 0.7194 (0.088) | 0.7231 (0.0084) | 0.6821 (0.140) | 0.7308 (0.1392) | 0.6333 (0.1408) |
MNB with Refined Optimised Feature Sets with Enhanced Interpretability, Parsimony, Accuracy | |||||
Refine CFJO (6) | 0.6561 (0.0927) | 0.759 (0.0797) | 0.7397 (0.1270) | 0.84 (0.1132) | 0.633 (0.1408) |
Enhanced CFJO (8) | 0.6487 (0.080) | 0.7603 (0.0301) | 0.7679 (0.1190) | 0.8846 (0.1003) | 0.6667 (0.1377) |
AUC Classifier (Feature Numbers): i, j | p | SE | 95% CI of Mean Difference (j–i) | Effect Size dCohen | Effect Size Glass’ Δ | Common Language Effect Size (CLES) | 95% Confidence Interval for dCohen |
CFSO (15) vs. CFJO (22) | <0.0001 | 0.004 | 0.0403–0.0571 | 2.177 | 5.798 | 0.938 | 1.516–2.838 |
CFSO (15) vs. CFJO (8) | <0.0001 | 0.006 | 0.0746–0.0972 | 2.835 | 2.854 | 0.977 | 2.093–3.577 |
CFSO (15) vs. CFJO (6) | <0.0001 | 0.011 | 0.0620–0.1072 | 1.402 | 1.061 | 0.839 | 0.817–1.987 |
CFJO (22) vs. CFJO (8) | <0.0001 | 0.004 | 0.0289–0.0455 | 1.683 | 1.236 | 0.883 | 1.074–2.293 |
CFJO (22) vs. CFJO (6) | =0.0011 | 0.011 | 0.0147–0.0571 | 0.634 | 0.45 | 0.673 | 0.097–1.17 |
CFJO (6) vs. CFJO (8) | =0.9093 | 0.011 | −0.0213–0.0239 | 0.022 | 0.043 | 0.506 | −0.502–0.545 |
Sensitivity Classifier (Feature Numbers) | p | SE | 95% CI of Mean Difference | Effect Size dCohen | Effect Size Glass’ Δ | Common Language Effect Size (CLES) | Confidence Interval for dCohen |
CFSO (15) vs. CFJO (22) | 0.1373 | 0.026 | −0.0892–0.0124 | −0.283 | −0.276 | 0.579 | −0.809–0.244 |
CFSO (15) vs. CFJO (8) | <0.0001 | 0.022 | 0.0715–0.1593 | 0.983 | 1.151 | 0.757 | 0.429–1.538 |
CFSO (15) vs. CFJO (6) | =0.0029 | 0.023 | 0.0247–0.1169 | 0.575 | 0.625 | 0.658 | 0.041–1.11 |
CFJO (22) vs. CFJO (8) | <0.0001 | 0.023 | 0.1084–0.1992 | 1.268 | 1.533 | 0.815 | 0.694–1.842 |
CFJO (22) vs. CFJO (6) | <0.0001 | 0.024 | 0.0617–0.1567 | 0.861 | 0.965 | 0.729 | 0.313–1.408 |
CFJO (8) vs. CFJO (6) | =0.0294 | 0.020 | 0.0045–0.0847 | 0.417 | 0.445 | 0.616 | −0.112–0.947 |
Specificity Classifier (Feature Numbers) | p | SE | 95% CI of Mean Difference | Effect Size dCohen | Effect Size Glass’ Δ | Common Language Effect Size (CLES) | Confidence Interval for dCohen |
CFSO (15) vs. CFJO (22) | <0.0001 | 0.027 | 0.0796–0.1870 | 0.929 | 0.947 | 0.744 | 0.378–1.481 |
CFSO (15) vs. CFJO (8) | <0.0001 | 0.027 | 0.1136–0.2198 | 1.175 | 1.211 | 0.797 | 0.607–1.742 |
CFSO (15) vs. CFJO (6) | <0.0001 | 0.027 | 0.0793–0.1867 | 0.92 | 0.945 | 0.744 | 0.376–1.479 |
CFJO (22) vs. CFJO (8) | =0.2071 | 0.026 | −0.0188–0.0856 | 0.24 | 0.243 | 0.567 | −0.286–0.766 |
CFJO (22) vs. CFJO (6) | =1 | 0.027 | −0.0527–0.0527 | −0.002 | −0.002 | 0.501 | −0.526–0.522 |
CFJO (8) vs. CFJO (6) | =0.2071 | 0.026 | −0.0856–0.0188 | −0.242 | −0.239 | 0.568 | −0.768–0.284 |
Test Result Pair(s) | Mean Difference | Asymptotic 95% Confidence Interval | ||
---|---|---|---|---|
Lower | Upper | p-Value * | ||
Enhanced CFJO vs. MSL Full | 0.2107 | 0.1466 | 0.2748 | 0.004998 |
Enhanced CFJO vs. Semantic Full | 0.09548 | 0.0261 | 0.1649 | 0.005075 |
Enhanced CFJO vs. CFSO | 0.05445 | −0.0317 | 0.1406 | 0.010272 |
Enhanced CFJO vs. CFJO | 0.05321 | 0.0177 | 0.0887 | 0.012907 |
CFSO vs. MSL Full | 0.15625 | 0.1316 | 0.1809 | 0.004998 |
CFSO vs. Semantic Full | 0.04103 | −0.0508 | 0.1329 | 0.045328 |
CFSO vs. CFJO | −0.00123 | −0.1171 | 0.1146 | 0.297107 |
CFJO vs. MSL Full | 0.15748 | 0.0637 | 0.2513 | 0.004922 |
CFJO vs. Semantic Full | 0.042267 | −0.0526 | 0.1372 | 0.092125 |
Semantic Full vs. MSL Full | 0.11522 | 0.0377 | 0.1928 | 0.004998 |
Techniques | MT-Error-Prone Texts | Non-MT Error-Prone Texts | p * |
---|---|---|---|
Mean Probability, SD (n = 30) | Mean Probability, SD (n = 26) | ||
Flesch Reading Ease Scores (60) | 30.9667 (14.51) | 34.8462 (11.29) | 0.097 |
Gunning Fog Index (12) | 14.3423 (2.34) | 14.7067 (3.087) | 0.349 |
SMOG Index (12) | 14.8154 (1.88) | 15.2133 (2.12) | 0.227 |
MLS Full (20 features) | 0.4437 (0.45) | 0.3411 (0.413) | 0.693 |
Semantics Full (115 features) | 0.8423 (0.32) | 0.5645 (0.44) | 0.050 |
MLS + Semantics Full (135) | 0.6078 (0.46) | 0.4441 (0.47) | 0.095 |
MLS optimised (5) | 0.5928 (0.08) | 0.579 (0.075) | 0.749 |
Semantics optimised (10) | 0.6362 (0.18) | 0.5086 (0.17) | 0.004 |
MLS + Semantics separate optimised (CFSO) (15) | 0.6378 (0.18) | 0.5257 (0.18) | 0.026 |
MLS + Semantics jointly optimised (CFJO) (22) | 0.7186 (0.29) | 0.4487 (0.36) | 0.004 |
Refined CFJO (6) | 0.7520 (0.28) | 0.4483 (0.33) | 0.001 |
Enhanced CFJO (8) | 0.7543 (0.27) | 0.4506 (0.33) | 0.001 |
Probability Thresholds | Sensitivity (95% CI) | Specificity (95% CI) | Positive Likelihood Ratio (LR+) (95% CI) | Negative Likelihood Ratio (LR−) (95% CI) |
---|---|---|---|---|
0.2 | 0.9615 (0.888, 1.00) | 0.3333 (0.165, 0.502) | 1.4423 (1.1072, 1.8789) | 0.1154 (0.0158, 0.8419) |
0.4 | 0.8846 (0.762, 1.00) | 0.4333 (0.256, 0.611) | 1.5619 (1.1086, 2.1984) | 0.2663 (0.0851, 0.8328) |
0.5 | 0.8846 (0.762, 1.00) | 0.6667 (0.498, 0.8354) | 2.6539 (1.570, 4.4851) | 0.17308 (0.058, 0.517) |
0.6 | 0.8077 (0.656, 0.959) | 0.6667 (0.498, 0.8354) | 2.4231 (1.4125, 4.1568) | 0.2885 (0.126, 0.656) |
0.8 | 0.5385 (0.347, 0.730) | 0.80 (0.657, 0.943) | 2.69 (1.210, 5.988) | 0.577 (0.367, 0.907) |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Xie, W.; Ji, M.; Huang, R.; Hao, T.; Chow, C.-Y. Predicting Risks of Machine Translations of Public Health Resources by Developing Interpretable Machine Learning Classifiers. Int. J. Environ. Res. Public Health 2021, 18, 8789. https://doi.org/10.3390/ijerph18168789
Xie W, Ji M, Huang R, Hao T, Chow C-Y. Predicting Risks of Machine Translations of Public Health Resources by Developing Interpretable Machine Learning Classifiers. International Journal of Environmental Research and Public Health. 2021; 18(16):8789. https://doi.org/10.3390/ijerph18168789
Chicago/Turabian StyleXie, Wenxiu, Meng Ji, Riliu Huang, Tianyong Hao, and Chi-Yin Chow. 2021. "Predicting Risks of Machine Translations of Public Health Resources by Developing Interpretable Machine Learning Classifiers" International Journal of Environmental Research and Public Health 18, no. 16: 8789. https://doi.org/10.3390/ijerph18168789
APA StyleXie, W., Ji, M., Huang, R., Hao, T., & Chow, C.-Y. (2021). Predicting Risks of Machine Translations of Public Health Resources by Developing Interpretable Machine Learning Classifiers. International Journal of Environmental Research and Public Health, 18(16), 8789. https://doi.org/10.3390/ijerph18168789