1. Introduction
The Singapore Stone, which is a piece of monolithic sandstone, was initially discovered in 1819.
Figure 1 shows that the initial location of the monolith was at the mouth of the Singapore River. The inscription is still regarded as one of the most crucial and unrelenting epigraphic mysteries in Southeast Asia [
1].
The monument is connected with the maritime trade history of ancient Temasik, which was a fourteenth-century port city [
2,
3]. Temasik participated in broader regional economic and cultural networks [
2,
3]. The memory of the Stone is also kept alive in the local folklore. The legend of the strongman Badang, which appears in the
Sejarah Melayu (
Malay Annals), tells that he dropped a very big rock into the river [
4,
5]. In 1843, the Stone met its irreparable loss, when British engineers caused the explosion to make an opening to Fort Fullerton. This blew the monument to pieces (
Figure 2) [
1,
6]. The artefact that remained is housed in the National Museum of Singapore as a national treasure. It still reminds us of what has been lost and of the intellectual challenge that the inscription poses [
7,
8].
This speculation has lasted for two hundred years, and efforts to decipher the inscription have given more guesses than answers. This made the eroded surface of the script a hindrance to early work. Sir Stamford Raffles and Dr William Bland drew an incorrect conclusion on this matter, suggesting that the script was Pali [
9,
10]. Subsequently, theories of Tamil influence have been proposed (1834), such as those of Captain Peter James Begbies, who preceded the modern view that the writing is that of the Later Kawi school [
11,
12]. Nevertheless, there is a complicated history of this consensus. There is a scholarly dispute on the language used in the inscription, whether it is Old Javanese or Sanskrit. Even the diacritics are remarkably small, and thus hard to read in the Brahmic family of writing [
6,
13]. This interpretive stalemate shows a significant gap in the research. Computational methods may assist scientists to overcome the constraints of the traditional palaeographic analysis [
1,
14].
The physical history of the fragments is also complicated. After the destruction of the stone in 1843, some of the inscribed pieces were saved. They were deposited in the Museum of the Royal Asiatic Society in Calcutta (since renamed the Indian Museum) to be studied by Lieutenant-Colonel James Low [
1,
15,
16]. The works were left for decades in Calcutta. It is also suspected that much of the Stone was cut in Singapore into gravel, which further diluted the corpus [
6]. One piece was returned in 1918, which weighed 80 kg and measured 67 cm, and it is currently known as the Singapore Stone [
6,
17]. The rest of the pieces are believed to be in Calcutta, but their location is uncertain. This also endows the surviving text with even greater significance as it is the primary source of information [
1].
Figure 2.
As reported by [
18]. The three fragments of the Singapore stone.
The script itself is the most detailed contemporary investigation, which assigns the script to Later Kabi (
Figure 3). Lee and Perono Cacciafoco facilitate this classification with a long comparison with the Calcutta Stone, which is an inscription dated 1041 CE [
12,
13]. The two inscriptions have a lot of similar characters, which justifies that they be grouped together. Nevertheless, Lee and Perono Cacciafoco also give an aberration. Singapore Stone contains an abnormally low ratio of diacritics and clusters of characters. This is uncharacteristic of Kawi traditions, and it makes it directly more difficult to decipher [
13,
18]. The fact that the Stone is fractured contributes to this issue. It constrains our possibility to define ancient vowels of writing and the arrangement of syllables in Brahmic scripts. Considering these challenges, we promote computational and data-driven methods, which are capable of suggesting plausible reconstructions, rather than analyses.
A major gap in the literature is covered in this paper, which re-defines epigraphic restoration as a missing-data problem. We symbolise each of the surviving graphemes by a compact categorical ID, and we also make a note of every piece of lost text as a quantifiable number of character spaces. This representation maintains the locality of space of the inscription to be computed. The statistics are very scanty. Due to that reason, reconstruction is local and conservative. Our model is a smoothed first-order Markov transition model (bigram probabilities) that is trained on the observed immediately preceding IDs. We will then only propose reconstructions of short bounded internal gaps (contiguous NaN between observed IDs), restricted by a maximum length K (in a similar case, K = 5). Viterbi bridging is used to calculate the most probable completion of any viable gap. The next step is to estimate ambiguity with posterior marginals of a forward/back pass. We give such results as confidence scores and ordered sequences of alternatives. The proposed fillings can then be replicated by specialists in the form of grapheme exemplars to be inspected. This makes the workflow a process that can be repeated, not an automatic system of decipherment.
The restoration of texts and graphics is not easy. The Singapore Stone is difficult to read since the remaining piece of text is fragmented and unfinished. The initial monolith was lost and disintegrated, and it lacks sufficient philological data on which morphology and syntax can be rebuilt with confidence [
1]. The dating of the writing to the Later Kawi school is supported by most palaeographers. There are, however, a number of atypical characteristics that make phonological and linguistic interpretation difficult. The fact that there are relatively few diacritics in the script is one problem [
13], and the context of the archaeological background is another. The inscription is in a current condition, indicating the disturbance of time and imbalanced conservation. Any reconstruction suggested must then be clear of doubt, and it must not give speculation under the guise of being recovered text.
On a broader level, Southeast Asian palaeographic surveys indicate that Kawi and related Brahmic scripts are characterised by the organisation of grapheme inventories and localised patterned sequences. Even strands of strings, which are heavily damaged, may still have short-range regularities, with no translation necessary. The gaps and editorial interventions are already explicitly coded in the practice of digital humanities. They are treated by editors as objects, as opposed to muting omissions. This is common to epigraphic encoding and TEI/EpiDoc in particular [
19]. The text may also be put into the perspective of missing-data inference [
20] and the contemporary practice of imputation. Such methods put a strong focus on careful guesses and clear statements about uncertainty, such as in the case of gap marks that are regarded as a direct missing value [
21,
22]. Our approach to this concept is probabilistic sequence modelling. We infer the first-order transition structure using the observed adjacencies; then, we compute bounded-gap completions on the basis of dynamic programming. Instead of a single opaque answer, this workflow generates ranked and confidence-tagged restoration hypotheses [
23,
24].
4. Results and Discussion
The independent reproducible pipeline is used to compute all the results in this section using the same position-preserving dataset. The 32 inscription lines are coded in a one-dimensional array of grapheme IDs in the form of categorical representations. The exact number (N character spaces) of segments is multiplied by the exact Not-a-Number (NaN) so that the digital representation does not reduce the original spatial structure and gaps between gaps to a single continuous string. Due to the sparsity of the dataset, which is very intense, the discussion is made with deliberate caution: the workflow is presented to the expert with the hypothesis generator, rather than as an automatic decipherment system.
4.5. Real-World Applications of Concrete Restoration and Sets of Candidates
One of the issues raised in the review was that, in earlier drafts, there is no indication of what the algorithm would actually recommend to epigraphists. We have thus given two complementary perspectives of the output of restoration.
Figure 9 and
Figure 10 present six sample-bounded gap repairs (K = 5) as before/after panels and the known top five candidate sequences with their normalised relative probabilities in the top-five set, respectively.
The panels in
Figure 7 are made by directly building them off the gap report that the pipeline provided. In each of the cases, the left panel displays the entire line in the form of a position map: where cells are observed, they are black; where they are missing, they are light grey, and the gap is indicated by a black line. The retracted distance between the line by the middle panel is maintained. The finished line is displayed in the right panel and the IDs added are highlighted. The confidence of each completion is the average log(p
1/p
2) given over the gap positions.
The six examples in
Figure 8 are the following (all values are directly obtained at α = 0.5 and K = 5). The dimensions of the position map inscription in
Figure 4 are 32 units wide and 219 units long. Two distinct three-slot bounded gaps can be found in Line 1: the completion at gap start = 15 ((6, 6, 6)) has mean completion = 0.243, and the completion at gap start = 22 ((2, 2, 20)) has mean completion = 0.197. There is a two-slot filled gap in Line 5 at gap start = 7 (between 6 and 90) filled in as 6, 22 (mean confidence = 0.399). At gap start = 41 (between 17 and 7) in Line 7, there is a gap with five slots to fill, and this gap is filled with the values 6, 20, 20, 20, 4 (mean confidence = 0.301). In Line 29, there is a two-slot bounded gap with gap start = 8 (between 20 and 2) filled in as 2, 2 (mean confidence = 0.434). Lastly, Line 31 has a four-slot bounded gap at gap start = 31 (between 43 and 20) filled as 20, 20, 20, 20 (mean confidence = 0.236).
For every example, the full line position map (left), zoomed original segment (centre), and zoomed completed segment (right) are shown. The highlighted IDs are those that have been inserted; the mean calculated confidence is the mean of log(p1/p2) across the gap.
These panels are supplemented by
Figure 9, as it exposes candidate diversity. With a given bounded gap, it is possible that there are several possible sequences. The table indicates the five most likely sequences of the same local model and indicates their relative probabilities as part of that set of 5 (the sum of the values represents 1). These are not per-position confidence scores but are the aggregate sequence probabilities, as shown in
Figure 7.
In the case of the Line 29 gap (gap start = 8, gap length = 2, gap boundaries = 20, 2), the best candidate sequence (candidate sequence with highest relative probability) is 2, 2, and the relative probability of this sequence is 0.251. Nevertheless, there are still some competitive alternatives: 20, 2 (0.211), 6, 2 (0.196), 20, 20 (0.177), 20, 6 (0.165). This is among the cases when the tool should be used in the form of a hypothesis enumerator: it proposes the best completion, but also points at close options that a specialist can take into consideration as compared with palaeographic or linguistic evidence.
In the case of the single-slot gap (gap start = 46, bounds 2 and 6) in Line 11, the candidate with the highest probability is ID 20 with a relative probability of 0.410. The next candidates are 2 (0.163), 3 (0.153), 32 (0.148), and 6 (0.127). In cases of single slots, the differentiation between top candidates tends to correspond to greater mean confidence, since only one vacancy will be resolved.
In the case of the Line 7 five-slot gap (gap start = 41, end 17 and 7), the optimal sequence is 6, 20, 20, 20, 4 with a relative probability of 0.228. Other candidates indicate that the uncertainty spreads across the internal positions: 6, 6, 20, 20, 4 (0.212), 6, 6, 6, 20, 4 (0.197), 6, 22, 6, 20, 4 (0.194), and 1, 2, 2, 20, 4 (0.168). This set of rankings makes the ambiguity clear and avoids giving the false impression of a unique completion.
Probabilities are standardised against the top-five set; the fraction of certainty of the model over these gaps is summarised in the mean confidence scores in
Section 4.5.
4.6. Sensitivity, Mislabel Invariance, and Missingness Assumptions
The maximum eligible gap length K and the strength of smoothing α are controlled by two modelling options: K and α. Where there are different K values, the number of repaired sites is altered, but the long lacunae remain un-mended. In this dataset, K = 1 occupied 3 gaps (3 positions), K = 3 occupied 12 gaps (27 positions), and K = 5 occupied 17 gaps (50 positions). Making the footprint of the restoration auditable under the K reporting helps avoid accidentally attempting to repair parts of the structure in a hypothetical global rebuilding.
Smoothing α affects predictive sharpness in sparse transition graphs. Using the same masked-character protocol as
Section 4.3, α = 0.1 yields mean accuracy 0.590, α = 0.5 yields 0.533, α = 1.0 yields 0.472, and α = 2.0 yields 0.441. Stronger smoothing reduces contrast between plausible and implausible transitions and lowers accuracy in this dataset.
Figure 4,
Figure 5,
Figure 6,
Figure 7,
Figure 8,
Figure 9 and
Figure 10 report results for α = 0.5, so that completion outputs and validation statistics are generated from the same fixed-parameter setting.
The Markov process is categorical by nature, with the aim of maintaining the integrity of the model. It does not consider integer numbers (ordinal) or numeric values, but merely makes use of equality of labels and observed transition frequencies among states. This plan cushions against enforcing spurious mathematical designs and determines a semantically neutral depiction of the coded epigraphic data. One of these direct directives is a relabelling invariance test: take any random permutation of the ID/integer mapping, run the pipeline, and decode the predictions back, and the resulting completed sequences should be the same. Independent completions with two distinct random re-labellings were found to give the same bounded gap completions on an inverse mapping showing that numeric neighbourhood effects are not seen as dominating predictions, but as dominated by the transition structure.
Lastly, the epigraphic content that has been lost is seldom missing at random. The patterns of systematic lacunae due to the breakage patterns and edge loss may be observed in the heavy-tailed gap length distribution (
Figure 11) and the fact that the missingness depends heavily on position indices (
Figure 12). In order to preserve model integrity, we repeat the statement that the Markov process is a categorical process: it is not a numeric or ordinal procedure that uses integer numbers, but merely takes equality of labels and observed frequencies of transitions between states. In such a manner, arbitrary mathematical structures are avoided, and the coded epigraphic information is maintained in a semantically neutral representation. The pipeline improves the quality of results by choosing to predict only in scenarios with immediate evidential support. It guarantees the reliability and traceability of its reconstructions by including only short and well-grounded spans, avoiding speculation in larger gaps where context is unavailable. The confidence in generating a verifiable local hypothesis space provides a solid foundation for further work. Any extensions, such as incorporating explicit damage models or image-based evidence, would adhere to this principled structure, ensuring that future developments remain grounded in demonstrably local evidence.
In order to turn the ID-level restorations into the readings available to epigraphists, we restore visualisations of some of those bounded gap completions by once again rendering them as glyph-level restoration overlays (
Figure 13,
Figure 14,
Figure 15,
Figure 16 and
Figure 17). This reconstruction model is defined on the position-preserving encoded matrix definition (IDs are categorical states; missing slots are NaN) on its own. However, as a numeric ID cannot be viewed by a human reader, filled outputs are reconstructed into glyph examples to be observed. Every image in
Figure 13,
Figure 14,
Figure 15,
Figure 16 and
Figure 17 is presented in the form of a ‘before/after’ panel: the former is represented by the reference to the original inscribed piece of the text with a visible lacuna (damage or blank space) and the latter superimposes the suggested restored glyph/s of the prototype at its former location/s. More importantly, these overlays provide an explicit view of the hypothesis generated by the very same conservative rule applied in the rest of the paper (internal gaps only, with a maximum gap length of K = 5). With this glyph-level display, specialists are able to identify the palaeographic viability of the offered completion in its immediate context, without affecting the workflow, and can re-create such a graphical request in the form of the underlying ID predictions and gap reports.
Left: original cut piece of the inscription with a missing area that is localised. Right: the same segment with the model-imputed glyph exemplar(s) in orange at an internal gap (K = 5) according to the position-preserving representation that is encoded.
The algorithm offers a localised completion (bounded NaN run) which is short and adds a visualisation to the completion by overlaying the reconstructed glyph exemplar(s) in orange at the slot(s) which are missing. Lacunae that are long are left unfilled.
Left: the original crop; right: the restored glyph insertion (in orange) by the same Markov-bridge completion rule applied to obtain the quantitative results (bounded gaps only; K 5), thus allowing the plausibility of the plausible ones to be checked by the experts themselves.
Conservative bounded-gap repair is marked as an overlay in the form of an orange line within the inscription area, showing how numbered predictive enumeration overrides are translated into something understandable to a human reader as an enumerative restoration theory, leaving observed strokes around the repair untouched.
The reinstated glyph exemplar(s) (orange) are only implanted at the demarcated point of the gap, and the rest of the fragmentary area is maintained. This illustrates the workflow as an expert-facing hypothesis generator and not a global reconstructive system.
5. Conclusions
The current paper describes a unique and replicable system of interpreting micro-restoration of very tiny inscriptions as a structured variable of absent data, but this is chiefly valuable for epigraphists. This underlying novelty is a position-preserving encoding scheme: each instance of character space is converted into an explicit absent position, thereby ensuring that the transcription leaves the geometry of the original line intact and allows one to ask purposeful and location-dependent restoration queries, which are directly related to the artefact.
In our categorical bigram bridging model, applied to this representation, we offer a deliberately conservative construction mechanism of bridging. It merely utilises observed adjacencies amongst coded IDs and does not attempt to traverse gaps that are not evidentially supported. The K = 5 constraint and the bounded-gap rule are principled methodological safeguards that ensure that the workflow is veridical with respect to the claims it can make with the data it has. In practice, this provides a set of local conjectures, which are well-grounded but do not constitute a complete, systematic reconstruction.
Quantitative assessment supports this locally based approach. The mean top-one accuracy of the model in masked character recovery is 0.533 across 50 repetitions, which is far superior to both mode and frequency-matched random baselines. It demonstrates that the transcription does not exclusively encode global symbol frequency as its meaningful local structure. Particularly in the analysis of expert adoption, the model confidence scores are highly functional: as the model assigns higher confidence to predictions, the empirical accuracy increases, which implies that confidence is a tool that can be effectively used for triage. The system does not suggest a single opaque restoration; it provides a list of ranked options for each gap, allowing epigraphists to take into consideration any external information, including parallel texts and grammatical and palaeographic data, to make their decision.
The same principled design decisions, which also determine the scope of the system, facilitate transparency. The sparsity and non-random missingness of the data, as well as the first-order transition model, are evidence of the fact that the system is not in a position to encode higher-order linguistic regularities, and this points to the actuality of the constraints: this system is an efficient hypothesis generator, but not an autonomous decipherment engine. Finally, performance is grounded on open routine encodings, and this points out the importance of transcription practices.
The structure allows numerous extensions to future operation without compromising its internal verifiability and scientific intent. To begin with, the training corpus should be extended to similar inscriptions, which would enhance transition statistics without breaching the bounded-gap layouts. Moreover, the higher-order or variable-order types of categorical models can be added to be able to learn longer-scale patterns of the locality without being able to trade off interpretability, especially when associated with transparent ablation studies and uncertain reporting. Additionally, evidence in terms of images, like stroke similarity or damage models, may be integrated, and that would perhaps mend the symbolic explanation with the physical properties of artefacts and turn the existing overlay demonstrations into multi-modal validation frameworks in their entirety.
Altogether, this paper provides a generalizable and practical template for computational epigraphy in high-missingness scenarios: preserve positional integrity, operate with conservative models, communicate uncertainty, and provide restorations as ranked, testable hypotheses that can augment—but not replace—professional epigraphical judgement.