Next Article in Journal
Spectral Map-Based Tool for Backhaul Rural Network Optimization and Indirect Estimation of Demographic Behaviors
Next Article in Special Issue
A Design of CGK-Based Granular Model Using Hierarchical Structure
Previous Article in Journal
The Digital Revolution in the Urban Water Cycle and Its Ethical–Political Implications: A Critical Perspective
Previous Article in Special Issue
Slope Stability Classification under Seismic Conditions Using Several Tree-Based Intelligent Techniques
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Development of Open-Assistant Environment for Integrated Operation of 3D Bridge Model and Engineering Document Information

1
Department of Civil, Environmental and Architectural Engineering, University of Colorado, Boulder, CO 80309, USA
2
Research Institute for Safety Performance, Korea Authority of Land & Infrastructure Safety, Jinju 52856, Korea
3
Taesung SNI Singapore Branch, Singapore 208652, Singapore
4
School of Civil, Environmental and Architectural Engineering, Korea University, Seoul 02841, Korea
*
Author to whom correspondence should be addressed.
Appl. Sci. 2022, 12(5), 2510; https://doi.org/10.3390/app12052510
Submission received: 21 December 2021 / Revised: 22 February 2022 / Accepted: 23 February 2022 / Published: 28 February 2022
(This article belongs to the Special Issue Novel Hybrid Intelligence Techniques in Engineering)

Abstract

:
This study proposes a method for assistant environments to integrate 3D bridge model information and engineering document fragments. The engineering document content varies depending on the process. Therefore, we accept a loose coupling concept to support the independence of each information set instead of using a specific data model for effective integration. The engineering document is translated into an Extensible Markup Language (XML)-based structured format based on the explicit and apparent semantic structure of the document. An extended industry foundation classes (IFC) schema is proposed to manage the bridge information model, as well as document fragments. An information document (iMapDoc) is proposed to manage interim data to connect a 3D digital model, an IFC model, and engineering document fragments. Document fragments on a specific component in the 3D bridge model are retrieved to validate the developed integrated assistant module.

1. Introduction

A certain level of knowledge is required to understand and use construction information presented in the form of engineering documents, such as structural calculation documents. Therefore, an environment that can provide information regarding various types, such as documents or 3D digital models, depending on the intended purpose must be established; this is because completing a construction project is achieved through the collaboration of participants with various knowledge backgrounds. Tatum [1] emphasized the role of a three-dimensional (3D) model-based environment in promoting efficiency in the communication of all architect, engineering, and construction (AEC) project participants, and we agree with his opinion. Building information modeling (BIM) is a specialized application field of 3D model-based integration from an information point of view; furthermore, it is expected to be one of the most important applications for the integrated operation of BIM and document-type information in the AEC field. Data in BIM and documents must be managed, as well as linked or aligned, appropriately to achieve an effective integration of BIM and engineering documents in a system or environment.
The information of model objects is managed internally by the tool itself in the case of closed BIM, which generates a model in a proprietary format using BIM authoring tools. Access to model data is possible within the software or within an application programming interface (API) scope. End-users in the practical field often base their preference on closed BIM from the perspective of information management because model visualization and object generation/editing are performed within one platform [2]. However, this method may cause reliability issues, owing to restriction to full data access, platform dependency, and non-guaranteed transparency of internal processes [3]. In open BIM represented by industry foundation classes (IFC), access to all model object information and modification is enabled [4] according to the standard data access interface (SDAI) standardized by the International Organization for Standardization (ISO) [5]. This method, however, requires advanced programming knowledge to control the IFC physical file (IPF) directly for modifying geometries, and there are instability issues in the process of importing the IPF from the BIM tools.
No standardized method is available for data access, unlike in BIM, because the content included in the construction engineering document is in a denormalized form; hence, various relevant studies are being conducted. Marking-up on contents can be interpreted by a machine, although semi-automatic or manual preprocessing is required for generating mark-up [6]. BIM-based automated compliance checking (ACC), the core task of which is to identify the semantic meaning of codes or regulations [3], proceeds in a similar manner to that of BIM construction document connection. To identify the semantic meaning from documents, specific rules were assigned to the plain documents to extract the content [7,8], or an ontology model was used [9,10,11]. Recently, research that extracts meaning from regulations through natural language processing or machine learning has been actively conducted [12,13,14]. These studies focus on the “latent semantic structure” categorized by Wang et al. [15]. The connection between BIM and regulation focuses on mapping the attributes extracted from each model (BIM and document), rather than document fragments or elements from the ACC perspective [16,17,18]. This results in relatively high costs for extracting the required information, as well as inefficiency if the documents are used as non-geometric reference information instead of a regulatory review. If engineering documents are used as references, then the approach is effective when the “explicit or apparent semantic structure” is used to reconstruct construction documents [19,20,21].
Choi et al. [22] has performed BIM-document integration by adding the necessary rules to the IFC entities or properties. Opitz et al. [23] linked document contents stored in a repository with an IFC model via separate link elements. As summarized in Table 1, however, previous research mainly focuses on the perspective of document information extraction, and so do not consider the information connection considering the IFC schema or do not pay much attention to the relationship with the 3D object.
Herein, we studied methods that can link engineering documents as reference information with the bridge information model, and subsequently developed an integrated open-assistant environment. Furthermore, we developed a structuralizing method using the explicit and apparent semantic structure of unstructured plain document contents by improving the work of Kim et al. [20]. We adopted the extended IFC schema proposed by Park et al. [24] for bridge information modeling and described how to manage document information based on IFC. An Autodesk Revit-based add-in module was developed to generate and manage a bridge information model using the adopted IFC schema. An interim mapping document (iMapDoc) generated during the process can serve as an excellent bridge for a seamless interface between the closed and open BIM, as well as providing relevant document fragments for each model component, even when the 3D model objects are modified.

2. Document Analysis and Translation to Structured Format

2.1. Translation Process from Unstructured Engineering Document to XML Document

An engineering document saved as a plain text file is used as an input file to eliminate errors that may occur when recognizing letters from visualized documents. Figure 1 illustrates the entire process of generating a structured Extensible Markup Language (XML) document from a text document referred to by Kim et al. [20]. As depicted in Figure 1, the input engineering document is translated into an XML document through three main steps, according to the subtitle structure. The first step in performing translation to the structured XML document is to store each sentence of the contents sequentially by classifying strings for the types of heading symbols, headings, subtitles, content, and references into a temporary table based on the engineering document model. The types of heading symbols are used to identify the hierarchy of the contents. The temporary table is rearranged after identifying the sentence structure using the existing data stored in the temporary table. The hierarchical information of the contents is identified using the rearranged data connected to the types of heading symbols and the tree structure of the document. Finally, the XML file is generated using the information saved in the temporary table and hierarchical information.

2.2. Content Analysis of Engineering Documents

We defined several notations to describe the content of the engineering documents efficiently, as follows:
  • The string S = s 1 , s 2 , , s n represents a set of characters with a finite sequence; here, s k Ψ , 1 k n , and Ψ is the set of all the characters, including space, in a given document;
  • In A : : = B , the symbol ‘ : : = ’ implies that A can be expressed as B;
  • The symbol ‘|’ represents ‘or’.
The text information comprising an engineering document can be separated using finite string sets with sequences; the string set S i of the ith row can be expressed as Equation (1), referring to Kim et al. [20].
S i : : = H i | C i | H i C i | H i R i | C i R i | H i C i R i
where H i is a string set for the subtitle, H i = s 1 , s 2 , , s l ; C i is a string set for the document content, C i = s l + 1 , s l + 2 , , s m , R i ( = s m + 1 , s m + 2 , , s n ) represents the string set for reference, and 0 l m n .
Figure 2 depicts the content analysis algorithm for extracting the document structure based on Equation (1). More definitions and processes are available in the previous research by Kim et al. [20].

2.3. Identification of Bullet-Form Text Strings

To identify the syntax’s meaning, which comprises the text string of the engineering documents, the temporary table constructed through the component extraction algorithm of the document is used, as described in Section 2.2. T S i , the string set of a random ith row, can be expressed as shown in Equation (2).
T S i = H S I D i , h s i , h c i , C i , R i
where H S I D i is the unique group number including the heading h s i ; h c i is the string for the title except the heading symbol; and C i and R i are text strings for the content and reference, respectively, as defined in Equation (1). We regarded the ith row to i + n th row of the temporary table contents, when they satisfy all conditional expressions shown in Equation (3a), as the bullet-form text strings. Here, n = 1 , 2 , 3 , .
H S I D i = H S I D i + n
h s i + α B S d ( α = 0 , 1 , 2 , n )
C i = C i + n =
In Equation (3b), B S d is the set of heading symbol groups with the depth d, where d is a natural number exceeding 1, as defined by users. In Equation (3c), ∅ means the set with no elements and ∧ logical conjunction or meeting in a lattice. The identified bullet-form text strings according to Equation (3a) are assigned to the i 1 th document content (C) using Equation (4); subsequently, the contents in the temporary table are rearranged.
C n e w i 1 = C o l d i 1 + j = 1 n n l + h s j + h c j
where C n e w i 1 is the text string for the newly updated i 1 th document content, C o l d i 1 is the string text for the i 1 th document content before rearranging the content, and n l is the character that represents a new line.

2.4. Identification of Hierarchical Structure of Subtitles

The problem of translating unstructured document contents into a tree-shaped hierarchical structure can be described by estimating the depth of the headings of the corresponding contents. The hierarchical information of the headings is identified by comparing the number of heading symbol groups. We used the results from a previous study that provided a generalized solution to this problem [20]. Using the algorithm described above, the unstructured document content was regenerated into a structured document format (Figure 3) using the developed translator, as shown in Figure 4. The tags of the XML element indicate the content items describing the function of the sentence, and the heading symbols identified for hierarchical classification are expressed as the header attributes of the element. If the element should refer to other documents or codes, the references appear in the reference attribute of the element. The text data are expressed as parsed character data (PCDATA) of XML. The core functions of the translator shown in Figure 3 are combined into the integrated module to manage the bridge and document model, which will be explained later.

2.5. Performance Evaluation of Document Translation Module

We used precision and recall, which are widely used in the information retrieval field for a performance evaluation of the component extraction algorithm of a document. Precision and recall use the number of true positive (TP), true negative (TN), false positive (FP), and false negative (FN) values; TP means that the extracted title sentence is correct, TN means that the extracted content sentence is correctly recognized, and FP means that the algorithm misrecognizes a sentence as a title. FN means that the title sentence is not recognized as the title sentence. The equations used to measure the precision and recall are as follows.
P r e c i s i o n = T P T P + F P
R e c a l l = T P T P + F N
The hierarchies recognized by the proposed algorithm in this study are relative classification among heading symbols; the recognized results of the hierarchical classification of the preceding items affect the following items. Therefore, the module performance of this part was checked following these equations:
G A H = C G T P
P A H = C P T P
where G A H means ‘generalized accuracy for hierarchy labeler’ and P A H means ‘precise accuracy for hierarchy labeler’. C G is the number of results from which relative hierarchical classification was performed correctly among TPs, and C P is the number of results from which absolute hierarchical classification was performed correctly among TPs. Therefore, G A H and P A H represent ratio values corresponding to C G and C P , respectively.
The proposed algorithm performance, including the content extracting and hierarchical classification processes, was evaluated as the following equations.
G A A = C G T P + F N
P A A = C P T P + F N
where G A A means ‘generalized accuracy for application module’ and P A A means ‘precise accuracy for application module’.
Figure 5 shows the results of applying Equations (5)–(10) to 20 bridge engineering documents with an average number of sentences of 4814 and the number of title sentences to be extracted as 433. The lowest and mean values of PAA are 97.36% and 99.47%, respectively, so it can be judged that the performance as a content item extraction algorithm for BIM-based integration is sufficient.

3. IFC-Based Bridge Information Modeling with Document Metadata

3.1. IFC Schema for Bridge Model and Document Information

The most recent official IFC released by buildingSMART International (bSI) is version IFC4.0.2.1 [25]. IFC4.0.2.1 has opened up the possibility of future extensions for civil infrastructures (IfcCivilElement), although they will be removed in a future version. The unofficial version currently being developed (IFC4.3RC4) includes alignments (IfcAlignment), roads (IfcRoad), railways (IfcRailway), ports (IfcMarineFacility), and bridges (IfcBridge). Regarding the bridge structure, IFC4.3RC3 includes the IfcBridge entity in order to represent the spatial information of the bridge as a subtype of IfcFacility. IfcBridge has an enumeration type to embody a spatial function for the subspace of the bridge [26]. More specific spatial components constituting the bridge structure can be managed using the IfcBridgePartTypeEnum type of IfcFacilityPart. The representation of the physical element of the bridge can be used in the subtypes of IfcBuiltElement.
We used the schema developed by Park et al. [24] instead of IFC4.0.2.1 or IFC4.3RC4 for the following reasons:
  • IFC4.3RC4 is a schema under development that has not yet been officially released. Therefore, current BIM authoring tools, such as Autodesk Revit, cannot handle the information generated by IFC4.3RC4;
  • Information on the bridge structure components covered by IFC4.3RC4 is limited; it does not define bridge-specific and bridge-related attributes that should be treated as entity and attribute level.
IfcBridgeAddMeshfree, proposed by Park et al. [24], extends additional entities focusing on the bridge structure and bridge components (see Figure 6). The detailed elements of bridges regarded as enumeration types in IFC4.3RC4 are also defined as entities.
In the IfcBridgeAddMeshfree schema, IfcBridge is used to manage the spatial information of the bridge structure itself, IfcBridgeSpan is used to segregate the spatial information based on the bridge’s along the road, and IfcBridgeSpacePart is used for the transverse direction of the bridge. In order to represent the physical object of the bridge, it was categorized into girder part, slab part, abutment and pier part, and detailed member component part. Each of these contain enumeration types for detailed types or functions.
External document-related information can be managed through the IfcDocumentInformation entity in the IFC schema. The IfcDocumentInformation can capture metadata, such as the document name, document purpose, and/or revision information of an external document, but not the document content. We linked each content fragment of a document corresponding to a model component rather than linking an entire document for an entire 3D model because IfcDocumentInformation can be associated with all IFC objects through the IfcRelAssociatesDocument entity. The implementation of IfcDocumentInformation-related information will be described later.

3.2. Assistant Module for Integrated Management of Bridge Model and Document Information

We developed a module to manage 3D model information, reflect the extended IFC-based bridge, and enable information retrieval for engineering documents related to bridge segments. The module was based on the Revit API provided by Autodesk [27], a representative BIM authoring tool. This enabled the integration of the extended IFC and engineering document retrieval in the Revit environment. We used the original functions provided by Revit to generate a 3D geometric model (Figure 7a). We modeled the components using general or building elements corresponding to bridge components referred by the concept of Park et al.’s [2] previous research for bridge components, such as bridge piers, that Revit cannot provide. As a note, to connect with the extended IFC entities for the bridge structure, each component should be created as one object. As a preliminary procedure, the user should directly generate spatial objects suitable for bridge and bridge components to apply IFC entities (see Figure 7b). This module suggests an appropriate IFC entity by parsing the EXPRESS files of the extended IFC schema for the spatial object, as depicted in Figure 7c. All 3D physical objects of the bridge collected automatically through the Revit API can be mapped with an appropriate IFC entity using the interface shown in Figure 7d. The appropriate IFC entity can be suggested through the basic information included in the object, such as the object’s family name and description. If the object information has a specific code or words included in the product breakdown structure (PBS) document developed in this study, then the physical object can be mapped precisely with the corresponding IFC entity using this PBS document. The PBS document comprises ab element code (value attribute), name (label attribute), IFC entity of the extended IFC-based bridge schema (IfcEntityName attribute), the IFC4 entity (Ifc4EntityName attribute), and the description (description attribute). Figure 8 shows a part of the PBS document developed in this study. In this module, the model object can be connected to not only the IFC schema entities developed for bridges, but also to the IFC4 entities, such that other commercial BIM authoring tools can utilize the IFC data generated through this module. Figure 9 shows a part of the iMapDoc containing essential information for the conversion to an extended IFC-based IPF, as well as for connecting a 3D object, IFC, and document fragments. We have designed the iMapDoc based on XML that can generate a relationship between bridge components, as well as connect attributes on the corresponding components. Various tested computer libraries related to the use of XML enable us to skip the verification process of the designed XML schema (iMapDoc).
The Project element in line 2 of Figure 9 stores the entire project-related information included in Revit, and is an element that is mapped with the IfcProject entity and its attributes of the IFC schema. The child elements of the StructuralElement in line 3 are mapped with the physical/spatial entities of the IFC. We used the IfcSite entity as the topmost element of all of the spatial elements. The spatial objects generated by the user, shown in Figure 7b, are arranged as IfcSite’s sub-entities. The spatial element representing the entire bridge itself is mapped to IfcBridge as an extended schema entity and IfcBuildingStorey as an IFC4 entity (line 7 of Figure 9). In this case, the IfcBuilding entity can be substituted for IfcBuildingStorey. Since no entities support bridge structures in IFC4, it is not essential to consider which entities of the IFC4 are used in this study. The entity just needs to be a spatial entity. The spatial objects constituting the entire bridge, such as the bridge section and the upper and lower structures, are mapped with each IFC entity, as shown in lines 11–17 of Figure 9. Line 19 of Figure 9 shows one of the physical elements of the bridge structure, i.e., Concrete_Pier, which includes not only the IFC entities (IfcEntityName, Ifc4EntityName), but also the object number (OidInSWDB) managed in Revit and the classification number (PBSCode) defined in the PBS. These data can serve as insights for linking information between the Revit objects and the IFC entity software independently, as well as between the IFC entities and engineering documents described in the following sections. The 3D geometries generated in the Revit are represented in the form of boundary representations (B-rep) in the IPF in this study. The vertex and edge information of the B-rep are composed of a separate file, as shown in Figure 10. The iMapDoc can manage this information through the identifier shown in line 24 of Figure 9. The CompNo attribute points to the ID of the vertex and triangle data, as shown in Figure 10. The iMapDoc can be converted to the IPF using subtypes of the IfcRelationship entity to connect “spatial object–spatial object,” “spatial object–physical object,” and “physical object–physical object” while sequentially reading elements of the iMapDoc. Property sets (lines 26–33 of Figure 9) in each element are connected to the parent object using the IfcRelDefinesByProperties entity in the IFC.
The structure of the document contents can be represented via the tree view, list view, or plain view features provided by the .NET framework, since the engineering document has been translated to the XML format (see Figure 7e). The document content query process is discussed in the next section.

4. Experimental Verification via Retrieval of Document Fragments Related to the 3D Model Object

4.1. Process of the Experimental Verification for Connected Document Fragments and 3D Model Object

The contents in the XML-formatted structured engineering document can be retrieved using the information included in the selected object of the bridge components via the interface shown in Figure 7a or Figure 7b. XML element tags, attributes of the element, and text data (PCDATA of an XML element) are used as keywords to match a specific segment in the document fragments. The query, response, and management of the XML data process are performed by combining the document object model (DOM) to treat XML data for language independently adopted as standard by W3C [28], regular expression to search string patterns introduced by Kleene [29], and XPath defined by W3C as a standard XML data query language [30]. Figure 11 shows the conceptual process for retrieving document fragments from structured engineering documents. The entire document retrieval process occurs in the Revit environment, as shown in Figure 11. Information on the 3D model can use both the iMapDoc developed according to the process described in Section 3.2 and the data included in the IPF. Figure 12 illustrates the process that occurs during the Query process shown in Figure 11. Exact word matching is prioritized; however, if an exact matching element does not exist, then the module matches semantically similar words using the Dictionary for synonyms shown in Figure 11. The Dictionary for synonyms database was developed in this study to increase the accuracy of word matching; it comprises the attributes of ID, Korean Word, English Word, Abbreviation, Symbol, and Synonym.

4.2. Connection Review of the Retrieved Document Fragments and the 3D Model Objects

Figure 13 shows a part of the data of the retrieved structured engineering document related to the Concrete_Pier object under Pier_2-4. These data are presented in Figure 7e, where the results are shown in a list view (➀ of Figure 14). Pier_2-4, as well as its parent and children elements, can be used to retrieve Concrete_Pier-related elements and information because the object selected in Figure 7b is synchronized with iMapDoc (see line 19 of Figure 9 and ➁ of Figure 14). The Dictionary for synonyms database was also used to retrieve the related content. The results selected by the user among the retrieved document contents are integrated with the IPF. The connection information between the 3D object and document content is stored in the Doc element in iMapDoc, as shown in line 23 of Figure 9 (➂ of Figure 14). The connection information indicates the location of the related document content translated into XML format according to the method described in Section 2 using XPath; line 23 of Figure 9 represents . / p i e r _ d e s i g n [ @ h e a d e r = 9 . ] . The Doc element can be added to all children elements of StructureElements of iMapDoc. The selected object from the 3D model or iMapDoc can be connected to the IFC entity mutually through the element name and the GlobalId attribute of the iMapDoc. Figure 15 shows a part of the IPF generated using a developed converting module including the Pier_2-4 object. The Pier_2-4 object was implemented as the IfcBridgeSpacePart entity, and its attributes are consistent with the attributes of GlobalId and Data in line 17 of Figure 9. Concrete_Pier was implemented as the IfcBridgePier entity, as shown in #2976952 in Figure 15, corresponding to line 19 of Figure 9 (➃ of Figure 14). The elements of iMapDoc and the entities of IPF share the entity name and global ID; the IFC and Revit are connected through the information of OidInSWDB and PBSCode stored in iMapDoc. The Representation attribute representing the shape information of the IfcBridgePier entity uses a subtype of the IfcProductRepresentation entity, and the IfcProductDefinitionShape entity is used for the representation, as shown in #1273678. The actual data are implemented using the IfcShapeRepresentation entity (#1274058), which utilizes the information in line 24 of Figure 9 (➄ of Figure 14). The IfcBridgeSpacePart (#2971755) and IfcBridgePier (#2976952) were connected through the IfcRelContainedInSpatialStructure entity (#1268533) since they are spatial and physical entities defined by the extended IFC, respectively. The pier_design-related information (line 23 of Figure 9) of the engineering document connected to the object of Concrete_Pier (line 19 of Figure 9) represents the IfcDocumentInformation entity (shown in #4852759 in Figure 15). Among the 17 attributes of the IfcDocumentInformation entity in the IFC4 schema, we generated values for attributes Identification, Name, Description, Location, ElectronicFormat, and Status automatically based on iMapDoc. The Identification attribute is used to identify documents, and the value data (pier_design-pSMyBLhGB06HLAvQv7x8RA) combined with the name (pier_design) of the connected document fragment and the GlobalId attribute data (pSMyBLhGB06HLAvQv7x8RA) of the parent element (Concrete_Pier) are used (➅ of Figure 14). The document fragment name (pier_design) is used for the Name attribute, and the file name of the XML-formatted construction document (Jaenaechon.xml) is represented in the Description attribute. The Location attribute specifies the location of the document in the form of a uniform resource identifier (URI); it ( J a e n a e c h o n / p i e r _ d e s i g n [ @ h e a d e r = 9 . ] ) is represented as using XPath, combining the path of the root element (Jaenaechon) of the construction document and the attribute data in the Doc element in Figure 9 to specify a concrete path for querying the document fragment. We specify application/xml according to the XML Multipurpose Internet Mail Extension (MIME-type) as the attribute data of ElectronicFormat for the type of media. The Status attribute indicates the document’s status represented by the IfcDocumentStatusEnum type, and we designated the .FINAL. value by default.
This example verifies that 3D models, engineering documents, and IFC can be run together using the proposed integrated approach. In particular, the document files were not deliberately reconstructed and were used as they were created in the bridge design work. Furthermore, we can map information objects from different sets of information using the names of the sub-bridges and components. The same names for subsystems and bridge components are typically used in bridge design drawings and documents. Therefore, it is believed that this method can facilitate the establishment of an integrated information environment for 3D bridge models and engineering documents.

5. Conclusions

Providing appropriate information in a valid and accessible format to construction participants with various knowledge backgrounds is the most important consideration for successful construction and management activities. Research pertaining to the efficient interconnectivity between 3D models and engineering records stored in documents is still in its infancy, whereas research for providing pertinent information based on the visibility of 3D model applications has been actively conducted in recent decades. As one of the methods to effectively deliver information used in the construction and management of bridge structures to users, we focused on integrating the bridge information model and engineering documents, which serve as references throughout the lifecycle. To achieve this, we first proposed a structuralizing method to transform unstructured plain text-typed engineering documents to XML documents by classifying titles and contents, defining hierarchies, and reconstructing them using explicit and apparent semantic structures. Second, an extended IFC schema that can handle bridge structure information was adopted, and IFC entities that can connect IFC and document information were selected. Finally, a Revit-based add-in module was developed to assist in the integrated operation of bridge information models and engineering documents. The module includes the functions of generating IFC spatial objects, placing physical objects into spatial objects, and interconnecting XML document contents related to model objects. Furthermore, it generates an extended IFC-based IPF that retains hierarchical object relationships and document connection information.
The main contribution of this study was the proposal of a new approach toward achieving an interconnected and integrated operation of the BIM authoring tool–IFC–engineering document using an interim information document named iMapDoc beyond the IFC– engineering document linkage. Each specialized engineering process evolves continuously as new engineering techniques and design philosophies are developed, which can naturally support the process conducted in each independent engineering domain. We expect the proposed process in this study to serve as a reference for future studies as follows:
  • Document content management using the IFC: Although the IFC contains most of the information that needs to be dealt with during the entire lifecycle of a facility, few studies suggest a technical process to control the document fragments. This study explained how IFC manages document fragments with examples;
  • Smart infrastructure: The successful adaptation of BIM for buildings promotes the growth of BIM for infrastructure for operating smart infrastructure. Interoperability, as well as data mapping between physical and digital models, are considered to be some of the essential keys of successful BIM for infrastructure [31,32]. The integrated operation process of the BIM authoring tool–IFC–engineering document proposed in this study can be a good reference.
Several issues should be addressed to implement the proposed integrated approach more comprehensively in practical working environments. The proposed approach uses the names of instances and type objects (e.g., types of cross-section, material, and connection) to map different information sets. Therefore, a systematic naming rule should be established to identify bridge components and object types based on the consensus of the project participants. Another drawback is that the developed module cannot process contents in figures or tables. However, the text information contained in the tables can be handled using a specific element node in a structured document. This indicates that the figures and tables can be correctly provided with subtitles.

Author Contributions

Conceptualization, S.I.P., B.-G.K. and G.Z.; methodology, S.I.P.; software, S.I.P. and B.-G.K.; validation, S.I.P. and W.G.; formal analysis, S.I.P.; investigation, S.I.P. and B.-G.K.; resources, S.I.P. and B.-G.K.; data curation, S.I.P. and W.G.; writing—original draft preparation, S.I.P. and B.-G.K.; writing—review and editing, S.I.P. and G.Z.; visualization, S.I.P. and B.-G.K.; supervision, G.Z.; project administration, G.Z.; funding acquisition, G.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT; No. NRF-2021R1A5A1032433).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Tatum, C.B. Integration: Emerging management challenge. J. Manag. Eng. 1990, 6, 47–58. [Google Scholar] [CrossRef]
  2. Park, S.I.; Park, J.; Kim, B.G.; Lee, S.H. Improving applicability for information model of an IFC-based steel bridge in the design phase using functional meanings of bridge components. Appl. Sci. 2018, 8, 2531. [Google Scholar] [CrossRef] [Green Version]
  3. Preidel, C.; Borrmann, A. BIM-based code compliance checking. In Building Information Modeling—Technology Foundations and Industry Practice; Borrmann, A., König, M., Koch, C., Beetz, J., Eds.; Springer International Publishing AG: Cham, Switzerland, 2018; pp. 367–381. [Google Scholar]
  4. Lin, J.R.; Hu, Z.Z.; Zhang, J.P.; Yu, F.Q. A Natural-Language-Based Approach to Intelligent Data Retrieval and Representation for Cloud BIM. Comput.-Aided Civ. Infrastruct. Eng. 2016, 31, 18–33. [Google Scholar] [CrossRef]
  5. ISO 10303-22:1998; Industrial Automation Systems and Integration—Product Data Representation and Exchange—Part 22: Implementation Methods: Standard Data Access Interface. International Organization for Standardization (ISO): Geneva, Switzerland, 1998.
  6. Hjelseth, E.; Nisbet, N. Capturing normative constraints by use of the semantic mark-up rase methodology. In CIB W78-W102 2011 28th International Conference—Applications of IT in the AEC Industry; Blackwell Publishing Ltd.: Sophia Antipolis, Valbonne, France, 2011; pp. 241–250. [Google Scholar]
  7. Beach, T.; Rezgui, Y.; Li, H.; Kasim, T. A rule-based semantic approach for automated regulatory compliance in the construction sector. Expert Syst. Appl. 2015, 42, 5219–5231. [Google Scholar] [CrossRef] [Green Version]
  8. Sydora, C.; Stroulia, E. Rule-based compliance checking and generative design for building interiors using BIM. Autom. Constr. 2020, 120, 103368. [Google Scholar] [CrossRef]
  9. Zhong, B.T.; Ding, L.Y.; Luo, H.B.; Zhou, Y.; Hu, Y.Z.; Hu, H.M. Ontology-based semantic modeling of regulation constraint for automated construction quality compliance checking. Autom. Constr. 2012, 28, 58–70. [Google Scholar] [CrossRef]
  10. Zhou, P.; El-Gohary, N.M. Ontology-based automated information extraction from building energy conservation codes. Autom. Constr. 2017, 74, 103–117. [Google Scholar] [CrossRef]
  11. Liu, K.; El-Gohary, N.M. Ontology-based semi-supervised conditional random fields for automated information extraction from bridge inspection reports. Autom. Constr. 2017, 81, 313–327. [Google Scholar] [CrossRef]
  12. Al Qady, M.; Kandil, A. Concept relation extraction from construction documents using natural language processing. J. Constr. Eng. Manag. 2010, 136, 294–302. [Google Scholar] [CrossRef]
  13. Zhang, J.; El-Gohary, N.M. Semantic NLP-based information extraction from construction regulatory documents for automated compliance checking. J. Comput. Civ. Eng. 2016, 30, 04015014. [Google Scholar] [CrossRef] [Green Version]
  14. Song, J.; Lee, J.K.; Choi, J.; Kim, I. Deep learning-based extraction of predicate-argument structure (PAS) in building design rule sentences. J. Comput. Des. Eng. 2020, 7, 563–576. [Google Scholar] [CrossRef]
  15. Wang, Z.; Wang, Y.; Gao, K. A new model of document structure analysis. In Lecture Notes in Computer Science—Fuzzy Systems and Knowledge Discovery; Wang, L., Jin, Y., Eds.; Springer: Berlin/Heidelberg, Germany, 2005; pp. 658–666. [Google Scholar]
  16. Eastman, C.M.; Lee, J.m.; Jeong, Y.S.; Lee, J.K. Automatic rule-based checking of building designs. Autom. Constr. 2009, 18, 1011–1033. [Google Scholar] [CrossRef]
  17. Jiang, S.; Wang, N.; Wu, J. Combining BIM and Ontology to Facilitate Intelligent Green Building Evaluation. J. Comput. Civ. Eng. 2018, 32, 04018039. [Google Scholar] [CrossRef]
  18. Zhou, P.; El-Gohary, N.M. Semantic information alignment of BIMs to computer-interpretable regulations using ontologies and deep learning. Adv. Eng. Inform. 2021, 48, 101239. [Google Scholar] [CrossRef]
  19. Ma, Z.; Li, H.; Shen, Q.P.; Yang, J. Using XML to support information exchange in construction projects. Autom. Constr. 2004, 13, 629–637. [Google Scholar] [CrossRef]
  20. Kim, B.G.; Park, S.I.; Kim, H.J.; Lee, S.H. Automatic extraction of apparent semantic structure from text contents of a structural calculation document. J. Comput. Civ. Eng. 2010, 24, 313–324. [Google Scholar] [CrossRef]
  21. Park, S.I.; Lee, S.H. Heuristic solution using decision tree model for enhanced XML schema matching of bridge structural calculation documents. Front. Struct. Civ. Eng. 2021, 14, 1403–1417. [Google Scholar] [CrossRef]
  22. Choi, J.; Choi, J.; Kim, I. Development of BIM-based evacuation regulation checking system for high-rise and complex buildings. Autom. Constr. 2014, 46, 38–49. [Google Scholar] [CrossRef]
  23. Opitz, F.; Windisch, R.; Scherer, R.J. Integration of document - and model-based information for project management support. Procedia Eng. 2014, 85, 403–411. [Google Scholar] [CrossRef] [Green Version]
  24. Park, S.I.; Lee, S.H.; Almasi, A.; Song, J.H. Extended IFC-based strong form meshfree collocation analysis of a bridge structure. Autom. Constr. 2020, 119, 103364. [Google Scholar] [CrossRef]
  25. buildingSMART International MSG. Industry Foundation Classes Release 4.0.2.1. 2017. Available online: https://standards.buildingsmart.org/IFC/RELEASE/IFC4/ADD2_TC1/HTML/ (accessed on 2 July 2021).
  26. buildingSMART International MSG. Industry Foundation Classes Release 4.3RC4. 2021. Available online: https://github.com/bSI-InfraRoom/IFC-Documentation/tree/main/4_3_0_0/rc4 (accessed on 3 July 2021).
  27. Autodesk. Revit—Multidisciplinary BIM Software for Higher-Quality, Coordinated Designs. 2021. Available online: https://www.autodesk.com/products/revit/overview (accessed on 2 July 2021).
  28. World Wide Web Consortium (W3C). DOM. 2021. Available online: https://dom.spec.whatwg.org/#what (accessed on 18 July 2021).
  29. Kleene, S.C. Representation of Events in Nerve Nets and Finite Automata. U.S. Air Force. 1951. Available online: https://www.rand.org/content/dam/rand/pubs/research_memoranda/2008/RM704.pdf (accessed on 20 December 2021).
  30. World Wide Web Consortium (W3C). XML Path Language (XPath) 3.1. 2017. Available online: https://www.w3.org/TR/2017/REC-xpath-31-20170321/ (accessed on 10 July 2021).
  31. Costin, A.; Adibfar, A.; Hu, H.; Chen, S.S. Building Information Modeling (BIM) for transportation infrastructure—Literature review, applications, challenges, and recommendations. Autom. Constr. 2018, 94, 257–281. [Google Scholar] [CrossRef]
  32. Merenda, M.; Praticò, F.G.; Fedele, R.; Carotenuto, R.; Corte, F.G.D. A Real-Time Decision Platform for the Management of Structures and Infrastructures. Electronics 2019, 8, 1180. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Document translation process for text information.
Figure 1. Document translation process for text information.
Applsci 12 02510 g001
Figure 2. Algorithm for content analysis to extract document structure.
Figure 2. Algorithm for content analysis to extract document structure.
Applsci 12 02510 g002
Figure 3. Mapping relationship between a plain and XML document.
Figure 3. Mapping relationship between a plain and XML document.
Applsci 12 02510 g003
Figure 4. Module developed to translate plain document to XML document.
Figure 4. Module developed to translate plain document to XML document.
Applsci 12 02510 g004
Figure 5. Performance evaluations based on precision, recall, GAH, PAH, GAA, and PAA.
Figure 5. Performance evaluations based on precision, recall, GAH, PAH, GAA, and PAA.
Applsci 12 02510 g005
Figure 6. EXPRESS-G diagram to manage bridge model and document meta data; the shaded boxes denote extended entities for the bridge structure.
Figure 6. EXPRESS-G diagram to manage bridge model and document meta data; the shaded boxes denote extended entities for the bridge structure.
Applsci 12 02510 g006
Figure 7. Module developed for connecting 3D model, IFC objects, and engineering document: (a) 3D bridge model, (bd) user-interfaces for the IFC-based bridge modeling, and (e) user-interface for document fragment retrieved based on query.
Figure 7. Module developed for connecting 3D model, IFC objects, and engineering document: (a) 3D bridge model, (bd) user-interfaces for the IFC-based bridge modeling, and (e) user-interface for document fragment retrieved based on query.
Applsci 12 02510 g007
Figure 8. PBS document (shown partially) for mapping between 3D geometry object and IFC entity.
Figure 8. PBS document (shown partially) for mapping between 3D geometry object and IFC entity.
Applsci 12 02510 g008
Figure 9. iMapDoc (shown partially) connecting 3D model object, IFC data, and engineering document fragment.
Figure 9. iMapDoc (shown partially) connecting 3D model object, IFC data, and engineering document fragment.
Applsci 12 02510 g009
Figure 10. Geometric information to represent the 3D object in IPF.
Figure 10. Geometric information to represent the 3D object in IPF.
Applsci 12 02510 g010
Figure 11. Process for retrieving document fragments from structured document in 3D model view.
Figure 11. Process for retrieving document fragments from structured document in 3D model view.
Applsci 12 02510 g011
Figure 12. Query process for extracting specific nodes from structured engineering document in 3D model view.
Figure 12. Query process for extracting specific nodes from structured engineering document in 3D model view.
Applsci 12 02510 g012
Figure 13. Data (shown partially) of retrieved structured engineering document related to Concrete_Pier object.
Figure 13. Data (shown partially) of retrieved structured engineering document related to Concrete_Pier object.
Applsci 12 02510 g013
Figure 14. Information flow among BIM authoring tool–IFC–engineering document.
Figure 14. Information flow among BIM authoring tool–IFC–engineering document.
Applsci 12 02510 g014
Figure 15. IPF (shown partially) generated using integrated assistant module related to Pier_2-4 object.
Figure 15. IPF (shown partially) generated using integrated assistant module related to Pier_2-4 object.
Applsci 12 02510 g015
Table 1. Overview of related research.
Table 1. Overview of related research.
Previous StudyExtraction of 3D Model Info.Extraction of Doc. Info.Integration, Mapping MethodLimitation
Hjelseth and Nisbet [6]XRASE 1 methodologyNot proposedIntegration method between 3D model and doc. Info. was not proposed.
Zhong et al. [9]XCQIEOntology 2 (Manual)Not proposedIntegration method between 3D model and doc. Info. was not proposed.
Choi et al. [22]IFCXIFC user-defined property sets (PSETs)Regulation codes must be mapped into IFC PSETs manually.
Opitz et al. [23]IFC schemaXSQL and BIMfit Model QueryThe document content should already be stored in DB in a fragile state.
Beach et al. [7]IFCExtended RASE (RASE + XML tag)Experts performed the mapping between the code fragments and IFC entities.Mapping was performed manually.
Zhou and El-Gohary [10]IFCRule-based OBIE 3 algorithmIFC–SIE 4–logic facts transformationOBIE algorithm highly depends on specific knowledge domain (building energy conservation codes).
Sydora and Stroulia [8]IFCRule Language (manual)IFC-based automatic mappingThe information should be organized to interpret into rule language manually.
1 RASE: requirement, applicability, selection, and exceptions. 2 CQIEOntology: quality inspection and evaluation ontology. 3 OBIE: ontology-based information extraction. 4 SIE: semantic information element.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Park, S.I.; Kim, B.-G.; Goh, W.; Zi, G. Development of Open-Assistant Environment for Integrated Operation of 3D Bridge Model and Engineering Document Information. Appl. Sci. 2022, 12, 2510. https://doi.org/10.3390/app12052510

AMA Style

Park SI, Kim B-G, Goh W, Zi G. Development of Open-Assistant Environment for Integrated Operation of 3D Bridge Model and Engineering Document Information. Applied Sciences. 2022; 12(5):2510. https://doi.org/10.3390/app12052510

Chicago/Turabian Style

Park, Sang I., Bong-Geun Kim, Wonhui Goh, and Goangseup Zi. 2022. "Development of Open-Assistant Environment for Integrated Operation of 3D Bridge Model and Engineering Document Information" Applied Sciences 12, no. 5: 2510. https://doi.org/10.3390/app12052510

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop