Testing the Capability of Low-Cost Tools and Artificial Intelligence Techniques to Automatically Detect Operations Done by a Small-Sized Manually Driven Bandsaw

Cheţa, Marius; Marcu, Marina Viorela; Iordache, Eugen; Borz, Stelian Alexandru

doi:10.3390/f11070739

Open AccessArticle

Testing the Capability of Low-Cost Tools and Artificial Intelligence Techniques to Automatically Detect Operations Done by a Small-Sized Manually Driven Bandsaw

Department of Forest Engineering, Forest Management Planning and Terrestrial Measurements, Faculty of Silviculture and Forest Engineering, Transilvania University of Brasov, Şirul Beethoven 1, 500123 Brasov, Romania

^*

Author to whom correspondence should be addressed.

Forests 2020, 11(7), 739; https://doi.org/10.3390/f11070739

Submission received: 1 June 2020 / Revised: 3 July 2020 / Accepted: 4 July 2020 / Published: 7 July 2020

(This article belongs to the Section Wood Science and Forest Products)

Download

Browse Figures

Versions Notes

Abstract

Research Highlights: A low-cost experimental system was developed to enable the production monitoring of small-scale wood processing facilities by the means of sensor-collected data and the implementation of artificial intelligence (AI) techniques, which provided accurate results for the most important work operations. Background and Objectives: The manufacturing of wood-based products by small-scale family-held business is commonly affected by a lack of monitoring data that, on the one hand, may prevent the decision-making process and, on the other hand, may lead to less technical efficiency that could result in business failure. Long-term performance of such manufacturing facilities is limited because data collection and analysis require significant resources, thus preventing the approaches that could be pursued for competitivity improvement. Materials and Methods: An external sensor system composed of two dataloggers—a triaxial accelerometer and a sound pressure level meter—was used in combination with a video camera to provide the input signals and meta-documentation for the training and testing of an artificial neural network (ANN) to check the accuracy of automatic classification of the time spent in operations. The study was based on a sample of ca. 90 k observations collected at a frequency of 1 Hz. Results: The approach provided promising results in both the training (ca. 20 k) and testing (ca. 60 k) datasets, with global classification accuracies of ca. 85%. However, the events characterizing the effective sawing, which requires electrical power, were even better recognized, reaching a classification accuracy of 98%. Conclusions: The system requires low-cost devices and freely available software that could enable data feeding on local computers by their direct connection to the devices. As such, it could collect, analyze and plot production data that could be used for maintaining the competitiveness of traditional technologies.

Keywords:

external sensor system; artificial intelligence; production monitoring; wood processing; manually driven bandsaws; improvement; competitiveness

1. Introduction

A sustainable provision of high-quality wood-based products to the market requires a supply chain designed to overcome bottlenecks and to meet an efficient allocation of commodities and resources flows. It should enable a diversified and customized offer as well as the resilience of different categories of stakeholders involved in it. From the forest to the customers, the wood supply chain follows a path that links several stakeholders, operations, logistics and transactions [1]. As such, it is now more than ever, that people working at different management levels need high amounts of reliable data to plan logistics and to make business decisions, because the failure of one component in the chain may lead to the failure of the entire chain. While the supply chain from the forest to the mill may be characterized by several weaknesses in different regions [2,3], a typical bottleneck is that related to the sawmilling industry and in particular to the small, family-held enterprises. On the one hand, they have formed for themselves a small segment of customers and, on the other hand, they are less adapted and vulnerable to rapid changes in the market, such as the rise in the price of raw materials [4]. Nevertheless, in many parts of the world, such businesses bring positive contributions to rural job markets and help in building the local economy.

In comparison to the modern sawmills that are transforming to resemble continuously running processes [5], small-scale sawmills are typically established as business running at lower or part-time rates; they are under pressure to optimize their production outputs and to save resources, under the so-called “sawmill paradox” [6]. From this point of view, to stay competitive, measures are needed to enhance their efficiency of production, which refers to both technical and allocative efficiency [7]. Nevertheless, this attempt is often compromised due to missing production efficiency data, which forms the basis of process control [5] and improvement. Data unavailability, on the other hand, may be due to the inability of such businesses to collect and interpret it. This is because they commonly focus on financial aspects and struggle to survive in a very competitive market [6]. They do not hold specialized departments to deal with data analytics and, as such, they could be seen as closed information systems, releasing in the mainstream only declarative data on their technical production capacity [8]. External data collection and analysis, on the other hand, is challenging because the output rates of small-scale sawmilling operations are typically low. Therefore, one needs to spend important resources of time to collect it and, at the end of the day, the analysis will provide only the information which is indicative for the time in which it has been collected. Most probably, this is also a reason for the limited effort spent in researching such operations since only couple of studies (e.g., [9,10,11,12,13]) were identified on the topic in the field.

One way to help such companies to survive and develop is to provide them with the tools needed to collect and analyze their own production data or to find cheap ways for such an attempt. To do so, formally, the technical efficiency may be addressed under the umbrella of work studies [14] which can use different instruments and approaches, starting from pen-and-paper methods and ending with the implementation of sensor systems. These approaches are characterized by different levels of accuracy and amounts of resources needed. What is most evident, is the fact that pen-and-paper studies have limited capabilities in gathering and analyzing long-term data, while sophisticated production monitoring systems are well beyond the financial capability of such establishments. An ideal solution would be an approach that is both inexpensive and capable of collecting and analyzing long-term data.

Collecting data automatically may mean that one could use video cameras to monitor the operations. However, the approach has been found to be resource-intensive in the office phase, judging by the amount of time needed to process and analyze the data (e.g., [15,16,17]), while it cannot automate the effort of analysis. A mid-path would be affordable dataloggers equipped with sensors able to capture signals [18] typical in such operations that carry enough information to enable accurate classification. These data would then form the input for analysis, classification, and, at a later time, decision making. Sound pressure level and acceleration sensors are examples of inexpensive devices that have been previously used to automatically collect information in several applications (e.g., [19,20,21,22,23]). However, the resulting signals lack the self-analytical ability, a reason for which other approaches are needed to extract meaningful information. As such, a common approach to learn from signals and classify the events is the utilization of artificial intelligence (AI) techniques, which have gained scientific attention in forestry both in event classification (e.g., [24]) and solving multivariate quantitative and qualitative problems (e.g., [25]). Among these, a class of nonlinear learning techniques is artificial neural networks (ANN), which can solve multivariate nonlinear classification problems [26] at high efficiency return rates.

Globally, there are many models of saws used to process the wood, which are available in different sizes and levels of technology integrated into them. A typical difference between them rests in their capability to automatically monitor the production, with some which are integrating such functions while some are much simpler by construction and do not hold such capabilities. In addition, some manufacturers provide production monitoring systems as an option that comes at supplementary costs, and many entrepreneurs working in the industry do not purchase them due to their limited financial availability and cost-saving reasons. For small-scale business, it is quite typical to use a lower-level technology in such operations and, in addition, a lot of the equipment used could be manually driven, even though they are electrically powered. As such, the situation prevents extensive data gathering on their performance, which limits understanding of the factors that affect or drive it. In addition, in well-established industries, the analysis of big data is of crucial importance to make decisions and to balance resources, a fact that applies also to small entrepreneurs.

This work is experimental in nature and it aimed to test whether it is possible to use triaxial accelerometers and sound pressure level sensors as low-cost data collectors to monitor the production of a simple small-sized locally manufactured bandsaw held by a small-scale family business. The concept behind the work was that of documenting the signals produced by the two types of sensors, which was completed by the use of video surveillance, followed by the use of techniques of artificial intelligence (AI) to train and test an artificial neural network (ANN) and to see to what extent the signals could be used to monitor the production. The choice of the bandsaw was based also on the model’s wide use in the small business from the region.

2. Materials and Methods

2.1. Facility Description and Machine’s Functions

The data needed in this study were collected in a small-scale family-held wood-processing facility located in Harghita county (Romania), in 2018. A full description of the facility is given in Figure 1, along with the main inputs and outputs of the production and the machines used in the sawmilling operations.

The sawmilling machine is manually driven and adjusted, requires one worker to operate it and it is electrically powered. A common feature of machines from this class is, however, the technical functions they enable, irrespective of whether or not they are mechanically or manually operated. The whole range of such machines provide cutting functions that are used to detach parts from the logs by a forward-backward movement of the cutting frame, which is supported by the possibility of vertical adjustment. The later enables one to set the cutting thickness at the desired dimensions and the cutting blade rising–lowering to accommodate them. For the machine observed in this study, only the active feeding of the blade into the logs was electrically powered since the worker that operated the machine turned on the engine only for this phase of sawmilling. The rest of work elements were manually powered and consisted of forward and backward movements of the cutting frame as well as of frame adjustments on the height. As a rule, these were completed with the engine turned off.

2.2. Data Collection and Processing

Data collection was completed by the use of three devices. An Extech^® 407760 sound level meter and an Extech^® VB 300 triaxial accelerometer (Extech Instruments, FLIR Commercial Systems Inc., Nashua, NH, USA) were used to collect the raw input signals (S—sound pressure level, dB(A) and A—acceleration, g) used in this work. They were set up to collect observations at a sampling rate of 1 Hz. The accelerometer was mounted on the machine’s frame while the sound pressure level datalogger was mounted on the worker’s helmet to enable also the collection of data on exposure to noise. However, exposure to noise was not addressed in this study. The full procedures used for setting, data transfer and data pairing, as well as the capabilities and dimensional features of the used devices are described in [27] and [21,22,23], respectively. A small-sized Schwartz B1080 video camera was placed on a wall of the facility to cover in the field of view the operations; it was set up to monitor the operations by continuously collecting video files at the maximum length enabled by it (20 min), and the operations were surveyed for three working days by recording and saving the data on internal memory.

Back at the office, the data collected by the first two devices were organized in a Microsoft Excel^® (Microsoft, Redmond, WA, USA, 2013 version) sheet; then, the video footage was used to document it by considering three types of events: cutting (C), moving (M) and pauses (P). To do so, string codes were used in conjunction with the video files played at low speed and each observation received a code (C, M or P) depending on the event to which it was identified to belong, based on the video analysis. As such, cutting (hereafter Cut) covered those observations in which the engine was on and the blade engaged into active cutting. Moving (hereafter Move) corresponded to all the events which supposed the movement of the cutting frame (forward feed, backward, vertical adjustment) without having the engine on, and pauses (hereafter Pause) consisted of events in which there was no intention to operate the machine observed, but still, the worker was near it. At this point, some parts of the initial dataset needed to be removed to cover only those events restricted to the machine use, as described in Table 1. In addition, Table 1 shows the input signals and their purpose in the framework of this study. For the acceleration signal, this study used a normalization procedure which aimed to enhance the independence of the datalogger orientation in the three-dimensional space. As such, vector magnitudes (g) were used as input signals instead of the axial responses, a procedure that is easy because the datalogger used outputs this signal derivation. In the case of sound pressure level data (dB(A)), for graphical comparison purposes, it was decided to use the datalogger’s output signal values divided by a factor of 10. However, this does not alter the pattern of the original signal, therefore, the outcomes of the training and testing algorithms are also unaltered. In total, 78,189 observations were retained for the training (20,050) and testing (58,139) of the ANN. Before doing so, however, a median filtering procedure using a window size of 3 points (observations) was applied to remove the impulse noise and some data collection errors (see Figure 2, left side). The choice of this filter was due to its ability to preserve the edges of the signals in the time domain (e.g., [28,29]).

2.3. Setup of the Artificial Neural Network

The freely available Orange Visual Programming Software (version 3.2.4.1) [30] was used when setting up the ANN for training and testing. The rectified linear unit function (ReLu) was adopted as an activation function because it is assumed to solve nonlinear problems at high performances (e.g., [31,32]); however, it is worth mentioning that the used software enables the implementation of the most common activation functions. Adam solver (the stochastic gradient-based optimizer) was chosen and used mainly due to its low training costs [33], and the L2 penalty regularization term was set at 0.0001. Then, A_MTRAIN, S_MTRAIN and AS_MTRAIN signal datasets (see definitions in Table 1) were used to train the ANN and to produce the performance indicators needed to check which one of the signals was the best. The indicators used to check the ability of the signals to train the ANN, as well as to evaluate the performance of the ANN-developed model on the test signals, were those commonly described in similar studies [34,35], from which the area under the curve (AUC), classification accuracy (CA), precision (PREC) and recall (REC) were retained and used as a reference in this study. In order to do so, the ANN was set up to hold three hidden layers of 100 neurons each and to run 1,000,000 iterations for each train signal dataset. The setup of the ANN training, as described above, was rather an educated guess that tried to maximize the performance of the ANN in testing at the expense of computational cost; for this reason, the time needed to train the ANN on the three signals was also counted. As a fact, choosing the number of hidden layers and neurons is seen more like an art than a science; even though there are some methods described in the available literature which propose criteria for choosing the number of neurons and hidden layers [36,37], to the best of our knowledge, finding the means to provide the best practices for given cases is a problem that is yet to be solved. Then, training and scoring was completed by cross-validation assuming a stratified approach and a number of folds set at 20. After the training procedure, the performance metrics were evaluated and the best model was saved for further use in the testing phase which was applied to its corresponding test signal dataset. Based on the tested data, the analysis of findings went in more detail to see which events and to what amount were correctly classified, as well as to see which of them were misclassified as other events. For that, data was imported from the software into Microsoft Excel^® (Microsoft, Redmond, WA, USA, 2013 version) and a detailed analysis was carried out at event type and classification outcome levels. Since the refined signals were used in training and testing, for balancing purposes, an analysis was carried out to see the proportion of the events in the time domain of the refined, training and testing signals. All the supplementary analyses as described above, as well as the basic statistics of the refined signals and the artwork shown in the results, were carried out or produced in the Microsoft Excel^® (Microsoft, Redmond, WA, USA, 2013 version) software.

The computer architecture on which the ANN was setup, trained and tested had the following parameters: system type—Alienware 17 R3, processor—Intel^® Core™ (Intel, Santa Clara, CA, USA) i7-6700 HQ CPU, 2.60 GHz, 2592 MHz, 4 cores, 8 Logical Processors, installed physical memory (RAM)—16 GB, operating system—Microsoft Windows 10 Home. However, one should note that the training phase is that which is the most computationally intensive, and once a model is settled, the testing phase takes much less time.

3. Results

3.1. Descriptive Statistics of the Refined Signal Datasets

Figure 2 shows a partition of the refined signal datasets plotted against the true events which were documented in the time domain. At a first glance, Cut events could be identified quite easily in the S_REF signal (see definition in Table 1) by visual means, a fact that was generally true for all the dataset utilized in this analysis. A_REF (see definition in Table 1), on the other hand, has provided less separability in its pattern, assuming here a linear approach. These two phenomena may be explained by the process physics and mechanics involved in the observed events. In the case of the sound pressure level signal, the separability of Cut events was enhanced by the higher and steadier noise level produced by the interaction of the blade with the wood during such events. As such, the outputs of this signal produced fewer variable outputs in the amplitude domain. In the case of acceleration, however, it seems that movement of the cutting frame in events such as the Cut and Move interfered with the outputs, providing less linear separability. Since the placement of the accelerometer was on the frame, other external events could have been also affecting its outputs, a fact that was true also for the sound pressure level datalogger, but to a lesser extent. From this point of view, the information carried by the sound may provide better results compared to that provided by the acceleration.

The basic statistics of the two refined signal datasets have revealed some important information that could be used to judge the separability of data and to justify the need to filter it. Even if not given as a table here, the minimum, maximum, mean and standard deviation values are presented. For instance, in the case of A_REF, Cut events were characterized by a range of values between 1.01 and 3.78 g, averaging 1.16 ± 0.12 g; the same statistics were 1.01 to 3.51 and 1.17 ± 0.11 for the Move events, and 1.01 to 4.98 and 1.14 ± 0.11 g for the Pause events, respectively. In the case of S_REF, they were 5.09 to 10.19 and 8.48 ± 0.48 dB(A)/10 for the Cut events, 2.82 to 10.02 and 6.26 ± 0.73 dB(A)/10 for the Move events, and 0.01 to 10.57 and 4.83 ± 1.42 dB(A)/10 for the Pause events, respectively. As such, it is obvious that the refined signal datasets provided less information assuming at least a linear separability, even though the frequencies of the observations on magnitude categories were not documented in this study. Part of these effects, reflected in the main statistical descriptors (i.e., minimum value of 0.01 for S_REF in case of Move), were also due to some impulse noise or measurement and recording errors, as shown in Figure 2, on the left side in the case of S_REF and in the central part in the case of A_REF.

The analysis of the true events shares in the used signals revealed the results shown in Table 2. While it is typical for many applications of ANN learning techniques to use a higher proportion of the dataset to train the model, in this work only ca. 25% of the data was used to train the ANN, based on the assumption that the computational effort should be kept to a minimum.

As shown in Table 2, the proportions of the true events encoded in the used signal datasets were similar, providing a good balancing of the data used in different steps of the ANN implementation. They also reflect the proportion of time used in operations that may characterize the efficiency of production, which is typical to small-scale facilities. As shown, close to 70% of the time was used for different pauses, and only ca. 30% for operations. Of the later, only ca. 20% of the time was used in effective cutting. In these conditions, it is quite usual for such facilities to process less than 10 m³ per day, with a usual daily input of ca. 5 m³.

3.2. Training Results and Selection of the Model

The results of the ANN training phase, which has used the three median-filtered input signals, are given in Table 3. Using both the acceleration and sound pressure level median-filtered signals (AS_MTRAIN) in the training process needed ca. 2.8 times more time resources compared to using only the acceleration (A_MTRAIN), and ca. 2 times more time resources compared to using only the sound pressure level (S_MTRAIN).

The area under the curve (AUC) stands for a metric which is often used to characterize the performance of a classifier in the area of receiver operating characteristics (ROC) graphs and it is equivalent to the probability that a classifier will rank a randomly chosen positively instance higher than a randomly chosen negative instance (e.g., [35]). It is also often assumed that the higher the AUC, the better the performance of a classifier. From this point of view, and by judging at the train signal datasets level, AS_MTRAIN provided the best results, while A_MTRAIN provided the poorest ones. Considering the same scale, S_MTRAIN was close in performance to AS_MTRAIN, with a difference in AUC of 0.005. At the event level, however, it seems that for the Move event, the AUC results were the poorest.

Classification accuracy (CA) is a metric that characterizes the percentage of correct predictions where a class that has the highest probability is the same as the targeted one [34], being interpreted as a metric of true classification [38]; it is averaged among all the classes specific to a problem [34] and stands for the ratio of true positive and negative values to the total of the observed values [34,35]. Based on the results shown in Table 3, the situation in regard to CA was similar to that of AUC, indicating a better performance of AS_MTRAIN, which was comparable to that of S_MTRAIN. However, at the event (class) level, the poorest results were those associated with Pause events. Precision (PREC), or the positive predicted values, accounts for the fraction of the instances identified by a classifier as true positives within the total number of positively classified instances [35]. For multi-class problems, the precision is calculated by averaging among the classes [34]. At the train signal datasets level, the situation was similar to the AUC and CA metrics, with AS_MTRAIN resulting in a better performance and the Move event showing the poorest one. In the A_MTRAIN signal, however, PREC performed very low, by “switching” the metrics between the Move and Cut events. Therefore, this signal did not provide enough information for this attempt.

Recall (REC), as a classification performance metric, stands for the fraction of true positives from the total amount of true positives and false negatives [34,35], which is also sometimes called and used as the true positive rate, hit rate or sensitivity of a classifier [35,38], being averaged in the case of multiclass problems [34]. As such it stands for the ratio of hypothesized positives to the total positives in a sample and, because of that, it may be the best indicator for efficiency-monitoring applications. As shown in Table 3, the situation on REC kept AS_MTRAIN as the best signal in the training phase, being closely followed by S_MTRAIN. The overall REC for AS_MTRAIN, however, reached only ca. 87%. Nevertheless, it provided quite accurate results for Pause and Cut (ca. 95%), but it performed poorer for Move (ca. 30%). In the case of A_MTRAIN, the results were the poorest, showing the signal’s inability to provide the information for a differentiation between the events. As such, all the data were recalled as belonging to the Pause event. The results of S_MTRAIN, on the other hand, showed a similar behavior to that of AS_MTRAIN for the REC metric, even though they were less accurate. The F1 metric stands for the harmonic mean of PREC and REC [34], and it is not discussed here but provided as a reference.

3.3. Statistics and Classification Performance on the Test Signal Dataset

Since AS_MTRAIN provided the best results for the performance metrics observed in this study, the testing phase of the ANN was implemented on the corresponding test signal dataset (AS_MTEST). The overall performance of the test signal dataset was characterized by an AUC of 0.939, a CA of 0.849, a PREC of 0.832 and a REC of 0.849 (Table 4). In the total number of correctly classified observations, the proportions of Cut, Move and Pause were of ca. 23, 5 and 72%, respectively.

However, 8773 observations from the AS_MTEST signal were incorrectly classified (Table 5), accounting for ca. 15%. In this subsample, the biggest inaccuracy problem seemed to be that of misclassifying Move events as Pause events (ca. 54%). This was followed by the misclassification of Pause events as Move events (ca. 26%) and by Pause events as Cut events (ca. 12%). Only 416 true Cut observations were misclassified as Pause (ca. 4%) or Move (ca. 1%) events.

By considering the data from Table 4 and Table 5, the REC of Cut was evaluated in the test set at ca. 96%, that of Move at ca. 39% and that of Pause at 91.5%. The results that are similar to those shown in Table 3 are for the REC calculated for AS_MTRAIN. Ultimately, it seems that the system provided good classification outcomes. This can be observed for the Cut and Pause events, for which the recall metric provided very good results, with their share in the train and test dataset accounted for the majority (Table 2).

4. Discussion

This study tested the possibility of implementing an external sensor system to automatically collect relevant operational data coupled with the techniques of ANN to enhance the automation of data classification and analytics. The applicability of the system is obvious in the context provided in the introduction section. One thing to be addressed is related to the factors that could favorize or prevent its internal implementation in such facilities. From this point of view, at least for the applied science, the system may provide a useful tool to externally monitor sawmilling operations in small-scale companies and to produce and analyze big datasets. This is proven by the approach and results of this study, which have demonstrated the utility of the system for such attempts. In such a case in which there is willingness to implement it internally, some points need to be addressed. The first one would be that of the system’s components cost. As such, the investment in the dataloggers used reaches the amount of EUR 450, being quite affordable from this point of view. Under the assumption of a well-designed ANN model, the camera could be excluded from the investment, while it is quite typical for many people to already hold a personal computer running the Microsoft Office Pack. Then, if a full automation of data analytics is in question, one should think about the connectivity of the dataloggers to a computer platform as well as to the additional software or routines needed there. Since the software used to run some data analysis was Microsoft Excel, and based on the fact that the ANN model and its figures on probabilities could be moved to Microsoft Excel, the only problem that would need to be solved is that of building some routines to bring and merge the signal data from the dataloggers’ software into Microsoft Excel files and to also automatically run the model in real time for classification purposes. This would also enhance the use of dataloggers at finer sampling rates. Since Microsoft Excel enables the use of routines external to its environment [39], this approach would be achievable at rather low costs.

ANN, as well as other classes of AI techniques, were widely used for multivariate classification problems in various fields of research and practice. Recent results show their good performance for both classification [24] and regression [25] problems in forestry as well in other related fields [34], with classification accuracies of over 90% termed generally as very good performances [34]. Judging the results of this study by this metric, the system tested has shown a very good classification performance with the classification accuracy reaching almost 90% irrespective of the event observed in the study; it was also close to 100% for Cut events in the training phase, which could be seen as excellent. Nevertheless, the performance of classification is still dependent on the complexity of phenomenon surveyed and on the quality of the information carriers [34]. For the typical case of ANN use, some have found very good classification performances, while others have found average or less performant ones [34,38]. Thus, some features may be more or less recognized by the models [38], depending also on their complexity, the chosen AI techniques and their ability to learn. Nevertheless, the classification performance ultimately needs to be related to the intended uses of the models. As such, even if the share of Cut events in this study was low, it is still important because the machine used electrical power in this event, and it is highly suggestive of the technical efficiency of the machine. Since the REC rate for Cut was close to 96%, the results could be interpreted as very good; still, this will mean that from each one hour of effective cutting, close to 2.5 min of events will be misclassified, which could have an impact on results scaled to longer periods.

The described outcomes were related, in this study, to the information carrier, because it seems that in the training phase, the sound pressure level signal yielded classification performances, which were comparable with those of using both signals. It was probably equally difficult for the ANN to accurately learn movement events from the sound information carrier as it was to learn it from the acceleration information carrier. This is because the movement events were completed at similar speeds, while the location of the accelerometer enabled it to collect a general magnitude that was not sufficiently separable between the events. In the case of sound, it is possible for Pause and Move events to generate similar patterns in the signal, and only Cut is able to be more distinguishable. As the most contribution in the classification performance came from the sound carrier, it worth mentioning that close to 80% of the misclassifications were those confusing movements with pauses and vice versa. Therefore, the implications are evident, advocating for choosing better locations for the dataloggers. One may speculate that in a configuration that would suppose the use of both collectors in real applications, the best choice would be to place the sound pressure level datalogger as close as possible to the place in which the interaction between the cutting blade and the log will occur, thus accounting for the highest magnitudes and a better separability of the Cut events. This behavior was observed in the results of this study, even though the collector was placed on the helmet of the worker to be able to collect data regarding exposure to noise. In contrast, the acceleration datalogger should be placed on the lever used to manually operate the machine, thus accounting for a higher magnitude in the signal as a result of the lever movement, and enhancing the separability of Move and Pause events. While these locations should be carefully chosen and standardized, this is an approach that may still require trial and error.This is also related to the type of machine used and to the type and frequency of operations surveyed. Most probably, the machines operating vertically by feeding the logs in the blades will provide the opportunity to use just one data collector. In such cases, the operational complexity could be also lower. Given the types of signals used, there is the possibility of other operations running in parallel (e.g., the use of a chainsaw) to affect the results. However, this could be balanced to some extent by incorporating in the analysis the signals from both dataloggers under the assumption that they would be placed in the best positions. The extent to which impulse noise carries enough useful information in delimiting specific events should be also explored and, as mentioned before, a wise placement of the dataloggers could help in lowering the miss-classification of productive and non-productive time.

Last, but not least, the signal filtering procedure used in this study tried only to remove the impulse noise. Whether or not the use of a repetitive filtering procedure to reach to the root of the signal [29], or the use of a wider window to analyze and probably improve the signal-to-noise ratio, would enhance the separability of events and the ability of the ANN to learn them from the altered signals should be checked in the future. This also applies to the length of the input signals used for training purposes, which accounted for one quarter of the sample used in this study. Probably, the use of more data in the learning process would have been provided much better results, a fact that also needs to be checked since similar studies have typically used ratios as high as 90–10% for learning and testing, respectively [34].

5. Conclusions

Based on the results of this study, the main conclusion is that the tested system holds a promising potential for implementation in real world scenarios to accurately collect, process and analyze big datasets at a low cost, in real time, and under the current limitation that some more work would be needed to connect the loggers with the computer software. Such attempts could be facilitated also by the rapid development of cheap miniaturized data collectors. Under the assumption of an internal implementation, one should find the best locations to place the collectors and should maintain those locations for long-term monitoring of production, a fact that may require the development of completely new ANN models. The implementation of the system, on the other hand, will not only provide the science with the tools and evidence on the real performance of small-sized sawmills, but it will also contribute to a better internal planning, thus enhancing the competitiveness of small companies in the field.

Author Contributions

Conceptualization, M.V.M., E.I. and S.A.B.; data curation, M.C. and S.A.B.; formal analysis, M.C. and S.A.B.; funding acquisition, M.V.M. and E.I.; investigation, M.C.; methodology, S.A.B.; project administration, M.V.M., E.I. and S.A.B.; resources, M.C. and M.V.M.; software, S.A.B.; supervision, S.A.B.; validation, S.A.B.; visualization, M.V.M. and E.I.; writing—original draft, M.V.M., E.I. and S.A.B.; writing—review and editing, S.A.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

The authors acknowledge the technical support of the Department of Forest Engineering, Forest Management Planning and Terrestrial Measurements, Faculty of Silviculture and Forest Engineering, Transilvania University of Brasov in designing and conducting this study. The authors would like to thank the management of Transilvania University of Brasov for providing the funds needed for purchasing the dataloggers used in this study and to Eng. Arpad-Attila Lorincz for facilitating and supporting the data collection. This paper reports partial results of the PhD thesis under development by Eng. Marius Cheţa—The use of sensor systems in work measurement applications implemented in forest engineering, under the coordination of PhD School of the Transilvania University of Brasov.

Conflicts of Interest

The authors declare no conflict of interest.

References

Oprea, I. Tehnologia Exploatării Lemnului; Transilvania Publishing House: Brasov, Romania, 2008; 273p. [Google Scholar]
Rauch, P.; Wolfsmayr, U.J.; Borz, S.A.; Triplat, M.; Krajnc, N.; Klock, M.; Oberwimmer, R.; Ketikidis, C.; Vasiljevic, A.; Stauder, M.; et al. SWOT analysis and strategy development for forest fuel supply chains in South East Europe. Forest Policy Econ. 2015, 61, 87–94. [Google Scholar] [CrossRef]
Rauch, P.; Borz, S.A. Reengineering the Romanian Timber Supply Chain from a Process Management Perspective. Croat. J. For. Eng. 2020, 4, 85–94. [Google Scholar] [CrossRef]
Fornea, M.; Bîrda, M.; Borz, S.A.; Popa, B.; Tomašić, Ž. Harvesting conditions, market particularities or just economic competition: A Romanian case study regarding the evolution of standing timber. Sumar. List 2018, 9–10, 499–508. [Google Scholar]
Lundahl, C.G. Optimized Processes in Sawmills. Licentiate Thesis, Luleå University of Technology, Skellefteå, Sweden, 2007. [Google Scholar]
Grönlund, A. Sågverksteknik del 2—Processen. Sveriges Skogsindustriförbund; Arbio: Markaryd, Sweden, 1992; p. 270. ISBN 91-7322-150-3. [Google Scholar]
Hyytiäinen, A.; Viitanen, J.; Mutanen, A. Production efficiency of independent Finnish sawmills in the 2000′s. Baltic For. 2011, 17, 280–287. [Google Scholar]
Sbera, I. Wood resources and the market potential in Romania. (Resursele de lemn şi potenţialul pieţei din România). Meridiane For. 2007, 2, 3–7. (In Romanian) [Google Scholar]
Gigoraş, D.; Borz, S.A. Factors affecting the effective time consumption, wood recovery rate and feeding speed when manufacturing lumber using a FBO-02 CUT mobile bandsaw. Wood Res. 2015, 60, 329–338. [Google Scholar]
Cedamon, E.D.; Harrison, S.; Herbohn, J. Comparative analysis of on-site free-hand chainsaw milling and fixed site mini-bandsaw milling of smallholder timber. Small-Scale For. 2013, 12, 389–401. [Google Scholar] [CrossRef]
De Lasaux, M.J.; Spinelli, R.; Hartsough, B.R.; Magagnotti, N. Using a small-log mobile sawmill system to contain fuel reduction treatment cost on small parcels. Small-Scale For. 2009, 8, 367–379. [Google Scholar] [CrossRef]
Ištvanić, J.; Lučić, R.B.; Jug, M.; Karan, R. Analysis of factors affecting log band saw capacity. Croat. J. For. Eng. 2009, 30, 27–35. [Google Scholar]
Venn, T.J.; McGavin, R.L.; Leggate, W.W. Costs of portable sawmilling timbers from the acacia woodlands of Western Queensland, Australia. Small-Scale For. Econ. Manag. Policy 2004, 3, 161–175. [Google Scholar] [CrossRef]
Acuna, M.; Bigot, M.; Guerra, S.; Hartsough, B.; Kanzian, C.; Kärhä, K.; Lindroos, O.; Magagnotti, N.; Roux, S.; Spinelli, R.; et al. Good Practice Guidelines for Biomass Production Studies; CNR IVALSA Sesto Fiorentino (National Research Council of Italy—Trees and Timber Institute): Sesto Fiorentino, Italy, 2012; pp. 1–51. ISBN 978-88-901660-4-4. [Google Scholar]
Muşat, E.C.; Apăfăian, A.I.; Ignea, G.; Ciobanu, V.D.; Iordache, E.; Derczeni, R.A.; Spârchez, G.; Vasilescu, M.M.; Borz, S.A. Time expenditure in computer aided time studies implemented for highly mechanized forest equipment. Ann. For. Res. 2016, 59, 129–144. [Google Scholar] [CrossRef]
Borz, S.A.; Adam, M. Analysis of video files in time studies by using free or low-cost software: Factors that quantitatively influence the time consumption data processing and its prediction. Revista Pădurilor 2015, 130, 60–71. [Google Scholar]
Contreras, M.; Freitas, R.; Ribeiro, L.; Stringer, J.; Clark, C. Multi-camera surveillance system for time and motion studies of timber harvesting equipment. Comput. Electron. Agr. 2017, 135, 208–215. [Google Scholar] [CrossRef]
Borz, S.A. Turning a winch skidder into a self-data collection machine using external sensors: A methodological concept. Bull. Transilv. Univ. Braşov 2016, 9, 1–6. [Google Scholar]
Cheța, M.; Borz, S.A. Automating data extraction from GPS files and sound pressure level sensors with application in cable yarding time and motion studies. Bull. Transilv. Univ. Braşov 2017, 10, 1–10. [Google Scholar]
Cheța, M.; Şerban, D.; Ignea, G.; Derczeni, R.A.; Sfeclă, V.; Borz, S.A. Using sound pressure sensors to monitor the performance of manually operated circular saws: What parameters and to what extent can they be inferred? Revista Pădurilor 2017, 132, 15–22. [Google Scholar]
Borz, S.A.; Talagai, N.; Cheţa, M.; Gavilanes Montoya, A.V.; Castillo Vizuete, D.D. Automating data collection in motor-manual time and motion studies implemented in a willow short rotation coppice. Bioresources 2018, 13, 3236–3249. [Google Scholar] [CrossRef]
Borz, S.A.; Talagai, N.; Cheţa, M.; Chiriloiu, D.; Gavilanes Montoya, A.V.; Castillo Vizuete, D.D.; Marcu, M.V. Physical strain, exposure to noise and postural assessment in motor-manual felling of willow short rotation coppice: Results of a preliminary study. Croat. J. For. Eng. 2019, 40, 377–388. [Google Scholar] [CrossRef]
Marogel-Popa, T.; Cheța, M.; Marcu, M.V.; Duță, C.I.; Ioraș, F.; Borz, S.A. Manual cultivation operations in poplar stands: A characterization of job difficulty and risks of health impairment. Int. J. Environ. Res. Public Health 2019, 16, 1911. [Google Scholar] [CrossRef]
Keefe, R.F.; Zimbelman, E.G.; Wempe, A.M. Use of smartphone sensors to quantify the productive cycle elements of hand fallers on industrial cable logging operations. Int. J. For. Eng. 2019, 30, 132–143. [Google Scholar] [CrossRef]
Proto, A.R.; Sperandio, G.; Costa, C.; Maesano, M.; Antonucci, F.; Macri, G.; Scarascia Mugnozza, G.; Zimbalatti, G. A three-step neural network artificial intelligence modeling approach for time, productivity and costs prediction: A case study in Italian forestry. Croat. J. For. Eng. 2020, 41, 35–47. [Google Scholar] [CrossRef]
Haykin, S.S. Neural Networks and Learning Machines; Pearson: Upper Saddle River, NJ, USA, 2009; Volume 3, 26p. [Google Scholar]
Cheţa, M.; Marcu, M.; Borz, S. Workload, exposure to noise, and risk of musculoskeletal disorders: A case study of motor-manual tree feeling and processing in poplar clear cuts. Forests 2018, 9, 300. [Google Scholar] [CrossRef]
Neal, C.G., Jr.; Gary, L.W. A theoretical analysis of the properties of median filters. IEEE Trans. Acoust. Signal Process. 1981, 29, 1136–1141. [Google Scholar]
Leeb, S.B.; Shaw, S.R. Applications of real-time median filtering with fast digital and analog sorters. IEEE/ASME Trans. Mechatron. 1997, 2, 136–143. [Google Scholar] [CrossRef]
Demsar, J.; Curk, T.; Erjavec, A.; Gorup, C.; Hocevar, T.; Milutinovic, M.; Mozina, M.; Polajnar, M.; Toplak, M.; Staric, A.; et al. Orange: Data Mining Toolbox in Python. J. Mach. Learn. Res. 2013, 14, 2349–2353. [Google Scholar]
Maas, A.L.; Hannun, A.Y.; Ng, A.Y. Rectifier nonlinearities improve neural network acoustic models. In Proceedings of the 30th International Conference on Machine Learning, ICML 2013, Atlanta, GA, USA, 16–21 June 2013. [Google Scholar]
Nair, V.; Hinton, G.E. Rectified linear units improve restricted Boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (ICML 2010), Haifa, Israel, 21–24 June 2010. [Google Scholar]
Kingma, D.P.; Ba, J.L. ADAM: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
Kamilaris, A.; Prenafeta-Boldu, F.X. Deep learning in agriculture: A survey. Comput. Electron. Agric. 2018, 147, 70–90. [Google Scholar] [CrossRef]
Fawcett, T. An introduction to ROC analysis. Pattern Recogn. Lett. 2006, 27, 861–874. [Google Scholar] [CrossRef]
Karsoliya, S. Approximating number of hidden layer neurons in multiple hidden layer BPNN architecture. Int. J. Eng. Technol. 2012, 3, 714–717. [Google Scholar]
Panchal, F.S.; Panchal, M. Review on methods of selecting number of hidden nodes in Artificial Neural Network. Int. J. Comput. Sci. Mob. Comput. 2014, 3, 455–464. [Google Scholar]
Nasir, V.; Nourian, S.; Avramidis, S.; Cool, J. Classification of thermally treated wood using machine learning techniques. Wood Sci. Technol. 2019, 53, 275–288. [Google Scholar] [CrossRef]
Borz, S.A.; Ignea, G. Aplicaţii V.B.A. şi M.S. Excel în Ingineria Forestieră; Lux Libris Publishing House: Braşov, Romania, 2013; 398p, ISBN 978-973-131-235-4. [Google Scholar]

Figure 1. Description of wood processing facility. Legend: in white: 1—log feedstock, 2—processed log, 3—machine (bandsaw), 4—processed planks, 5—regular circular saw, 6—processed products, 7—residues; in green: 1—transversal wooden strut, 2—fixing traverse, 3—rolling rail, 4—metallic cover, 5—blade, 6—frame, 7—water tank, 8—lever.

Figure 2. A partition of the refined signal datasets showing the true events in the time domain. Legend: A_REF—refined acceleration signal dataset, S_REF—refined sound pressure level signal dataset.

Table 1. Description of the initial, refined and median filtered input signal data.

Definitions of Signals	Abbreviation	Number of Observations	Purpose/Use
Initial acceleration signal dataset	A_INI	90,405	Reference of the study
Initial sound pressure level signal dataset	S_INI	90,405	Reference of the study
Initial acceleration and sound pressure level signals dataset	AS_INI	90,405	Reference of the study
Refined acceleration signal dataset	A_REF	78,189	Machine-related events
Refined sound pressure level signal dataset	S_REF	78,189	Machine-related events
Refined acceleration and sound pressure level signals dataset	AS_REF	78,189	Machine-related events
Median filtered acceleration signal dataset for training	A_MTRAIN	20,050	Removing impulses and train
Median filtered sound pressure level signal dataset for training	S_MTRAIN	20,050	Removing impulses and train
Median filtered acceleration and sound pressure level signals dataset for training	AS_MTRAIN	20,050	Removing impulses and train
Median filtered acceleration signal dataset for testing	A_MTEST	58,139	Removing impulses and test
Median filtered sound pressure level signal for testing	S_MTEST	58,139	Removing impulses and test
Median filtered acceleration and sound pressure level signals dataset for testing	AS_MTEST	58,139	Removing impulses and test

Table 2. Share of the true events in the signals used.

Signal Abbreviation	Number of Observations	Share of Events (%) in the Number of Observations
Signal Abbreviation	Number of Observations	Cut	Move	Pause
Refined (A, S, A and S)	78,189	20.29	13.40	66.31
Median filtered for training (A, S, A and S)	20,050	18.47	13.00	68.53
Median filtered for testing (A, S, A and S)	58,139	20.20	13.11	66.68

Table 3. Results of classification performance metrics following the training of the artificial neural network (ANN); area under the curve (AUC); classification accuracy (CA); Pause events. Precision (PREC); recall (REC); (F1) the harmonic mean of PREC and REC.

Input Signal	Training Time (s)	Event	Performance Metrics
Input Signal	Training Time (s)	Event	AUC	CA	F1	PREC	REC
AS_MTRAIN	350	Pause	0.938	0.871	0.910	0.873	0.951
		Move	0.888	0.884	0.400	0.608	0.299
		Cut	0.997	0.977	0.939	0.927	0.951
		Overall	0.944	0.866	0.849	0.848	0.866
A_MTRAIN	125	Pause	0.635	0.685	0.813	0.685	1.000
		Move	0.588	0.870	0.000	0.000	0.000
		Cut	0.629	0.815	0.001	1.000	0.000
		Overall	0.617	0.685	0.558	0.654	0.685
S_MTRAIN	175	Pause	0.932	0.860	0.903	0.862	0.947
		Move	0.880	0.878	0.362	0.570	0.265
		Cut	0.996	0.975	0.934	0.929	0.939
		Overall	0.939	0.857	0.838	0.837	0.857

Table 4. Number and share of correct classifications in the testing dataset.

Features	Number of Observations	Share in Correctly Classified	AUC	CA	F1	PREC	REC
Total correctly classified	49,366	100
Cut	11,330	22.95
Move	2577	5.22
Pause	35,459	71.83
Overall performance			0.939	0.849	0.838	0.832	0.849

Table 5. Number and share of misclassifications in the testing dataset.

Features	Number of Observations	Share in Misclassified
Total misclassified observations	8773	100
Cut misclassified as Pause	341	3.89
Cut misclassified as Move	75	0.85
Move misclassified as Cut	303	3.45
Move misclassified as Pause	4745	54.09
Pause misclassified as Cut	1016	11.58
Pause misclassified as Move	2293	26.14

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cheţa, M.; Marcu, M.V.; Iordache, E.; Borz, S.A. Testing the Capability of Low-Cost Tools and Artificial Intelligence Techniques to Automatically Detect Operations Done by a Small-Sized Manually Driven Bandsaw. Forests 2020, 11, 739. https://doi.org/10.3390/f11070739

AMA Style

Cheţa M, Marcu MV, Iordache E, Borz SA. Testing the Capability of Low-Cost Tools and Artificial Intelligence Techniques to Automatically Detect Operations Done by a Small-Sized Manually Driven Bandsaw. Forests. 2020; 11(7):739. https://doi.org/10.3390/f11070739

Chicago/Turabian Style

Cheţa, Marius, Marina Viorela Marcu, Eugen Iordache, and Stelian Alexandru Borz. 2020. "Testing the Capability of Low-Cost Tools and Artificial Intelligence Techniques to Automatically Detect Operations Done by a Small-Sized Manually Driven Bandsaw" Forests 11, no. 7: 739. https://doi.org/10.3390/f11070739

APA Style

Cheţa, M., Marcu, M. V., Iordache, E., & Borz, S. A. (2020). Testing the Capability of Low-Cost Tools and Artificial Intelligence Techniques to Automatically Detect Operations Done by a Small-Sized Manually Driven Bandsaw. Forests, 11(7), 739. https://doi.org/10.3390/f11070739

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Testing the Capability of Low-Cost Tools and Artificial Intelligence Techniques to Automatically Detect Operations Done by a Small-Sized Manually Driven Bandsaw

Abstract

1. Introduction

2. Materials and Methods

2.1. Facility Description and Machine’s Functions

2.2. Data Collection and Processing

2.3. Setup of the Artificial Neural Network

3. Results

3.1. Descriptive Statistics of the Refined Signal Datasets

3.2. Training Results and Selection of the Model

3.3. Statistics and Classification Performance on the Test Signal Dataset

4. Discussion

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI