<!DOCTYPE article
PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD with MathML3 v1.2 20190208//EN" "JATS-archivearticle1-mathml3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" article-type="research-article"><?properties manuscript?><front><journal-meta><journal-id journal-id-type="nlm-journal-id">101303912</journal-id><journal-id journal-id-type="pubmed-jr-id">50224</journal-id><journal-id journal-id-type="nlm-ta">Autom Constr</journal-id><journal-id journal-id-type="iso-abbrev">Autom Constr</journal-id><journal-title-group><journal-title>Automation in construction</journal-title></journal-title-group><issn pub-type="ppub">0926-5805</issn><issn pub-type="epub">1872-7891</issn></journal-meta><article-meta><article-id pub-id-type="pmid">33897107</article-id><article-id pub-id-type="pmc">8064735</article-id><article-id pub-id-type="doi">10.1016/j.autcon.2020.103322</article-id><article-id pub-id-type="manuscript">HHSPA1653730</article-id><article-categories><subj-group subj-group-type="heading"><subject>Article</subject></subj-group></article-categories><title-group><article-title>Fusing imperfect experimental data for risk assessment of musculoskeletal disorders in construction using canonical polyadic decomposition</article-title></title-group><contrib-group><contrib contrib-type="author"><name><surname>Dutta</surname><given-names>Amrita</given-names></name><xref ref-type="aff" rid="A1">a</xref></contrib><contrib contrib-type="author"><name><surname>Breloff</surname><given-names>Scott P.</given-names></name><xref ref-type="aff" rid="A2">b</xref></contrib><contrib contrib-type="author"><name><surname>Dai</surname><given-names>Fei</given-names></name><xref ref-type="aff" rid="A1">a</xref><xref rid="CR1" ref-type="corresp">*</xref></contrib><contrib contrib-type="author"><name><surname>Sinsel</surname><given-names>Erik W.</given-names></name><xref ref-type="aff" rid="A2">b</xref></contrib><contrib contrib-type="author"><name><surname>Carey</surname><given-names>Robert E.</given-names></name><xref ref-type="aff" rid="A2">b</xref></contrib><contrib contrib-type="author"><name><surname>Warren</surname><given-names>Christopher M.</given-names></name><xref ref-type="aff" rid="A2">b</xref></contrib><contrib contrib-type="author"><name><surname>Wu</surname><given-names>John Z.</given-names></name><xref ref-type="aff" rid="A2">b</xref></contrib></contrib-group><aff id="A1"><label>a</label>Department of Civil and Environmental Engineering, West Virginia University, P.O. Box 6103, Morgantown, WV 26506, United States of America</aff><aff id="A2"><label>b</label>National Institute for Occupational Safety and Health, 1095 Willowdale Road, Morgantown, WV 26505, United States of America</aff><author-notes><corresp id="CR1"><label>*</label>Corresponding author. <email>fei.dai@mail.wvu.edu</email> (F. Dai).</corresp></author-notes><pub-date pub-type="nihms-submitted"><day>17</day><month>12</month><year>2020</year></pub-date><pub-date pub-type="ppub"><month>11</month><year>2020</year></pub-date><pub-date pub-type="pmc-release"><day>23</day><month>4</month><year>2021</year></pub-date><volume>119</volume><elocation-id>10.1016/j.autcon.2020.103322</elocation-id><abstract id="ABS1"><p id="P1">Field or laboratory data collected for work-related musculoskeletal disorder (WMSD) risk assessment in construction often becomes unreliable as a large amount of data go missing due to technology-induced errors, instrument failures or sometimes at random. Missing data can adversely affect the assessment conclusions. This study proposes a method that applies Canonical Polyadic Decomposition (CPD) tensor decomposition to fuse multiple sparse risk-related datasets and fill in missing data by leveraging the correlation among multiple risk indicators within those datasets. Two knee WMSD risk-related datasets&#x02014;3D knee rotation (kinematics) and electromyography (EMG) of five knee postural muscles&#x02014;collected from previous studies were used for the validation and demonstration of the proposed method. The analysis results revealed that for a large portion of missing values (40%), the proposed method can generate a fused dataset that provides reliable risk assessment results highly consistent (70%&#x02013;87%) with those obtained from the original experimental datasets. This signified the usefulness of the proposed method for use in WMSD risk assessment studies when data collection is affected by a significant amount of missing data, which will facilitate reliable assessment of WMSD risks among construction workers. In the future, findings of this study will be implemented to explore whether, and to what extent, the fused dataset outperforms the datasets with missing values by comparing consistencies of the risk assessment results obtained from these datasets for further investigation of the fusion performance.</p></abstract><kwd-group><kwd>Tensor decomposition</kwd><kwd>Risk assessment</kwd><kwd>Data fusion</kwd><kwd>Construction safety</kwd></kwd-group></article-meta></front><body><sec id="S1"><label>1.</label><title>Introduction</title><p id="P2">Work-related musculoskeletal disorders (WMSDs) are one of the most common causes of days away from work and physical disabilities in the construction industry [<xref rid="R1" ref-type="bibr">1</xref>]. An increased exposure to risk factors in the workplace can enhance the likelihood of WMSDs; hence, proper identification of possible risk exposures and developing injury prevention strategies are essential to alleviate WMSDs.</p><p id="P3">Collecting risk exposure data with human subject involvement is most often accepted as the gold standard for understanding risky behaviors and conditions that may expose workers to WMSD risks on construction sites [<xref rid="R2" ref-type="bibr">2</xref>]. Generally, these data are collected in laboratory settings or real construction sites by technologies such as optical motion capture systems or surface electromyography sensors. However, data collected from these technologies often suffer from &#x02018;drop out&#x02019;, a phenomenon in which data is missing due to technology-induced errors (e.g., disconnection of sensors, errors in communicating with the database server, instrument failures), human-induced errors (e.g., accidental human omission) or other unknown reasons [<xref rid="R3" ref-type="bibr">3</xref>]. The result is incompleteness of the collected risk exposure data that may lead to invalid conclusions on the effects of the potential WMSD risk factors. Missing data is a common problem associated with data collection in ergonomic risk assessment using technologies, regardless of the quality of the research design [<xref rid="R4" ref-type="bibr">4</xref>]. Therefore, it should be carefully handled. In doing so, reserving the interrelation among the potential risk factors and the risk indicators across multiple datasets is necessary. Among several benefits of data fusion, one is revealing the latent pattern of the data and leveraging collaborative relationships among various factors within multiple datasets based on that pattern. This benefit can be utilized to reserve the interrelation among different factors and the risk indicators across the datasets to fill in the missing data. This study proposes a method for dealing with multiple imperfect and incomplete datasets by applying a Canonical Polyadic Decomposition (CPD) technique to treat the imperfect data for WMSD risk assessment. CPD decomposes the incomplete datasets based on the latent relationship among different risk factors and the risk indicators, then reconstructs a new dataset through fusion as a high-order tensor [<xref rid="R5" ref-type="bibr">5</xref>]. This newly reconstructed dataset is referred to as fused dataset, which can then be used for assessing the risk of WMSDs. To validate the effectiveness of the CPD-based method in assessing WMSDs, two WMSD risk-related datasets collected from prior experimental studies (original datasets) were intentionally modified to represent incomplete datasets. Then CPD was applied for fusion and to reconstruct the fused datasets. The risk assessment results obtained using the fused datasets were further compared to those obtained by using the original datasets to evaluate the performance of the fusion treatment.</p></sec><sec id="S2"><label>2.</label><title>Background</title><sec id="S3"><label>2.1.</label><title>Importance of research</title><p id="P4">Missing data is a common problem in research studies that involve human subjects and technologies for data collection. They can reduce statistical power of a study and lead to erroneous conclusions [<xref rid="R6" ref-type="bibr">6</xref>]. To potentially mitigate this issue, the sample size for data collection is typically increased. However, this is not always possible due to research design, limitations in budget and human resources. It is not always feasible to regenerate the data by repeating the experiment, as it can be costly and time-consuming. There are existing methods of handling missing data in literature such as simply omitting observations with the missing data [<xref rid="R7" ref-type="bibr">7</xref>], regression imputation [<xref rid="R8" ref-type="bibr">8</xref>], mean substitution [<xref rid="R9" ref-type="bibr">9</xref>], replacing the missing data with the last observed values [<xref rid="R10" ref-type="bibr">10</xref>], and estimating the missing data using conditional distribution of the other variables [<xref rid="R11" ref-type="bibr">11</xref>]. While useful, these methods may not be optimal for tackling data missing in WMSD risk assessment. Simply omitting observations with missing data typically decreases the sample size and may adversely affect the statistical power [<xref rid="R7" ref-type="bibr">7</xref>]. Regression imputation predicts a missing value from other variables, but it barely adds any new information except increasing the sample size and compromising the standard error [<xref rid="R8" ref-type="bibr">8</xref>]. If there is a great inequality in the proportion of missing values of different variables, mean substitution may lead to inconsistent biases and underestimation of errors [<xref rid="R9" ref-type="bibr">9</xref>]. Replacing missing data with the last observed values assumes that there will be no changes in the outcome [<xref rid="R10" ref-type="bibr">10</xref>]. Using conditional distribution of the other variables becomes inappropriate when a large amount of data is missing, as in this situation, the relationship among the variables needs to be properly computed [<xref rid="R7" ref-type="bibr">7</xref>,<xref rid="R11" ref-type="bibr">11</xref>].</p><p id="P5">Moreover, in real life scenarios, a phenomenon can be described and characterized with a set of factors. Each of these factors is referred to as the &#x02018;dimension&#x02019; of that phenomenon [<xref rid="R12" ref-type="bibr">12</xref>]. When a research problem includes multiple dimensions, it is considered &#x02018;multidimensional&#x02019;. This is often the case in WMSD risk studies, in which risk exposure data can be collected in terms of different measurements, often referred to as risk indicators, and working conditions for the purpose of in-depth understanding of risks. The working conditions may include various work settings and postures, and the risk indicator measurements may include kinematics, kinetics and electromyography (EMG) measurements. Quite often, multiple risk indicators can provide a more detailed insight into the worker&#x02019;s risk-inducing behavior, compared to what a single risk indicator can provide as these multiple risk indicators may be correlated. In WMSD risk studies that involves analyzing the effects of working conditions on risk indicators, the working conditions and the risk indicators can be termed as a &#x02018;dimension&#x02019; of the risk phenomenon. This type of data can be considered as high-dimensional data. When data is missing in multiple high-dimensional risk-related datasets, the existing methods will not be suitable as the interrelation among different risk indicators and potential risk factors across multiple datasets may not be well captured. Fuzzy modeling has been used in multidimensional missing data imputation [<xref rid="R13" ref-type="bibr">13</xref>]. However, for high-dimensional datasets where capturing the latent relationship among various dimensions is essential during imputation, the performance of Fuzzy algorithms degrades [<xref rid="R14" ref-type="bibr">14</xref>,<xref rid="R15" ref-type="bibr">15</xref>]. Besides, the performance of Fuzzy models in dealing with multiple datasets is yet to be tested. In contrast, data fusion can be an efficient method in this regard that integrates information based on the interrelation among the factors and risk indicators in such a way that it can produce a more complete representation of the measurements of the risk indicators, compared to that of an incomplete dataset [<xref rid="R16" ref-type="bibr">16</xref>].</p></sec><sec id="S4"><label>2.2.</label><title>State of research on data fusion in construction</title><p id="P6">In the construction industry, data fusion methods have been used for automated identification, location estimation, dislocation detection of construction materials in jobsites [<xref rid="R17" ref-type="bibr">17</xref>,<xref rid="R18" ref-type="bibr">18</xref>], automated progress tracking of construction projects [<xref rid="R19" ref-type="bibr">19</xref>], structural health monitoring [<xref rid="R20" ref-type="bibr">20</xref>], and damage identification of civil structures [<xref rid="R21" ref-type="bibr">21</xref>]. In construction ergonomics, a computationally efficient approach using data fusion was developed to recognize construction workers&#x02019; awkward postures [<xref rid="R22" ref-type="bibr">22</xref>]. Fusion of data from continuous remote monitoring of construction workers&#x02019; location and physiological status was used to identify safe and unsafe behaviors of construction workers [<xref rid="R23" ref-type="bibr">23</xref>]. Fusion of spatio-temporal and workers&#x02019; thoracic posture data has been done for understanding the worker&#x02019;s activity type for productivity assessment [<xref rid="R24" ref-type="bibr">24</xref>]. A position and posture data fusion method was proposed for evaluation of construction workers&#x02019; behavioral risks [<xref rid="R25" ref-type="bibr">25</xref>]. However, no study has been done that uses data fusion of incomplete risk-related datasets for assessing WMSD risks.</p></sec><sec id="S5"><label>2.3.</label><title>State of research on tensor decomposition for data fusion</title><p id="P7">To integrate multidimensional data, it is essential to learn the relationships between those dimensions to understand the multidimensional nature of a phenomenon [<xref rid="R12" ref-type="bibr">12</xref>]. There are currently several existing approaches for analyzing multiple datasets. Zheng et al. [<xref rid="R26" ref-type="bibr">26</xref>] summarized the methods of fusing multiple datasets into three categories. The first category is stage-based fusion methods, where different datasets are used at different stages of data mining tasks [<xref rid="R27" ref-type="bibr">27</xref>]. The second category is deep learning-based fusion methods that use a neural network to extract original features from multiple datasets and learn a new representation of those features for classification and prediction purposes [<xref rid="R28" ref-type="bibr">28</xref>]. The third category is semantic meaning-based fusion methods that identify the association between different features across multiple datasets and carry the semantic meaning, i.e., the latent relationship between the features during fusion [<xref rid="R26" ref-type="bibr">26</xref>]. Similarity-based data fusion is one type of the semantic meaning-based fusion methods, in which similarity between the features are measured from multiple datasets. These similarities can leverage the correlation among the features collectively and help obtain any missing information of a feature based on the information available for another feature. Based on the similarities, different datasets can be fused [<xref rid="R26" ref-type="bibr">26</xref>]. This study is interested in fusion of risk-related datasets containing different features that represent WMSD risk factors in construction and are inherently related to each other. Understanding the latent relationship between different features across multiple datasets plays a vital role in proper fusion. Also, learning the similarities between those features is useful to impute the missing values. As a result, the similarity-based data fusion methods lend themselves well to tackling the fusion problem in this study.</p><p id="P8">Tensor decomposition is one type of similarity-based data fusion methods [<xref rid="R29" ref-type="bibr">29</xref>]. Tensors are generalizations of matrices to higher dimensions, and are powerful to model multidimensional data [<xref rid="R30" ref-type="bibr">30</xref>]. Tensors are multidimensional arrays of numerical values that are widely used in different applications in data analysis and machine learning, including filling in missing values [<xref rid="R31" ref-type="bibr">31</xref>], anomaly detection [<xref rid="R32" ref-type="bibr">32</xref>], object profiling [<xref rid="R33" ref-type="bibr">33</xref>], discovering patterns [<xref rid="R34" ref-type="bibr">34</xref>], and predicting evolution [<xref rid="R35" ref-type="bibr">35</xref>]. Matrix factorization was previously used in filling out missing data [<xref rid="R36" ref-type="bibr">36</xref>]; however, it only works with two-dimensional data and may not be applicable to high-dimensional data like those used in WMSD risk studies. Tensor decomposition is a useful tool to deal with high-dimensional datasets. It accurately extracts the correlations among various dimensions from different datasets and learns the latent structures and collaborative relationships among the dimensions to approximate the pattern of the data [<xref rid="R37" ref-type="bibr">37</xref>].</p><p id="P9">With increases in dimensions of the datasets, the number of the elements in a tensor also increases. This increase in elements makes the dataset difficult to deal with in terms of computational and memory requirements. By decomposing the high-dimensional data presented in form of a tensor, the problems associated with high-dimensionality can be alleviated or even removed [<xref rid="R38" ref-type="bibr">38</xref>]. Also, tensor decomposition can be used as an efficient tool for missing value prediction [<xref rid="R39" ref-type="bibr">39</xref>,<xref rid="R40" ref-type="bibr">40</xref>]. Though tensor decomposition has been abundantly applied in areas such as scientific computing, signal processing, and social media data analysis, its use in construction industry-related datasets is limited. One application of tensor decomposition with construction datasets was concerned with awkward posture recognition in which worker&#x02019;s motion data was presented as high order tensor [<xref rid="R22" ref-type="bibr">22</xref>]. However, this technique has yet to be established as an alternative method of treating imperfect experimental data to provide meaningful and accurate risk assessment results in general WMSD risk assessment studies.</p><p id="P10">The two most widely used tensor decomposition algorithms are <italic>canonical polyadic decomposition</italic> (CPD) and <italic>Tucker decomposition</italic> (TD). CPD is generally advised for use with latent parameter estimation, while the TD is suggested in use of subspace estimation, compression, and dimensionality reduction [<xref rid="R5" ref-type="bibr">5</xref>]. As the objective of this study is to fuse two incomplete datasets based on the latent correlations among all dimensions of those datasets, CPD is more suitable than TD in fulfilling the purpose of this study. Moreover, CPD is very effective in capturing the interactions among various dimensions of a high-dimensional datasets and therefore, it can effectively be used for imputing missing data [<xref rid="R31" ref-type="bibr">31</xref>]. Considering many other tensor decomposition techniques are based on CPD and TD and CPD is more suitable for this study, the following subsection will only discuss CPD for tensor decomposition.</p><sec id="S6"><label>2.3.1.</label><title>Canonical polyadic decomposition (CPD)</title><p id="P11">In order to impute missing data, CPD extrapolates the latent structures and collaborative relationships among different dimensions, such as rows and columns of a tensor. The CPD method factorizes a tensor into a sum of component rank-one tensors [<xref rid="R37" ref-type="bibr">37</xref>]. For example, in CPD, a given three-dimensional tensor <italic><bold>Y</bold></italic> &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic>&#x000d7;<italic>J</italic>&#x000d7;<italic>K</italic></sup> can be written as:
<disp-formula id="FD1"><label>(1)</label><mml:math display="block" id="M1"><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>&#x02248;</mml:mo><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>R</mml:mi></mml:munderover><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">a</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">b</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">c</mml:mi><mml:mi>r</mml:mi></mml:msub></mml:mrow><mml:mo>&#x02261;</mml:mo><mml:mrow><mml:mo>&#x0301a;</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">B</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">C</mml:mi></mml:mrow><mml:mo>&#x0301b;</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>
where <italic>R</italic> is a positive integer, which is referred to as the rank of a tensor. The rank of tensor <italic><bold>Y</bold></italic> = <italic>R</italic> can be defined as the smallest number of rank-1 tensor that is required to represent the tensor <italic><bold>Y</bold></italic> as their sum [<xref rid="R5" ref-type="bibr">5</xref>] (<xref rid="F1" ref-type="fig">Fig. 1</xref>); Rank-1 tensor is defined as a decomposition of an <italic>N</italic>-dimensional tensor into one outer product of <italic>N</italic> vectors. <italic>R</italic> is also called latent factor. <italic>I, J</italic> and <italic>K</italic> are the sizes of the dimensions of the tensor <italic><bold>Y</bold></italic> where, <italic><bold>a</bold><sub>r</sub></italic> &#x02208; <italic>&#x0211d;<sup>I</sup></italic>, <italic><bold>b</bold><sub>r</sub></italic> &#x02208; <italic>&#x0211d;<sup>J</sup></italic> and <italic><bold>c</bold></italic><sub><italic>r</italic></sub> &#x02208; <italic>&#x0211d;</italic><sup><italic>K</italic></sup> for <italic>r</italic> = <italic>1</italic>&#x02026;<italic>R</italic>. &#x02018;&#x02218;&#x02019; indicates the vector outer product. <italic><bold>a</bold></italic><sub><italic>r</italic></sub> &#x02218; <italic><bold>b</bold></italic><sub><italic>r</italic></sub> &#x02218; <italic><bold>c</bold></italic><sub><italic>r</italic></sub> &#x02208;<italic>&#x0211d;</italic><sup><italic>I</italic>&#x000d7;<italic>J</italic>&#x000d7;<italic>K</italic></sup> is an outer product of three vectors <italic><bold>a</bold></italic><sub><italic>r</italic></sub>, <italic><bold>b</bold></italic><sub><italic>r</italic></sub> and <bold><italic>c</italic></bold><sub><italic>r</italic></sub> and is referred to as a rank-1 tensor. <italic><bold>A, B</bold></italic> and <bold><italic>C</italic></bold> are the factor matrices obtained for each dimension after decomposition of the tensor <italic><bold>Y</bold></italic>, and are referred to as the combination of the vectors from the rank-1 components. Here, the factors matrices <italic><bold>A</bold></italic> = [<italic><bold>a</bold></italic><sub>1</sub>, <italic><bold>a</bold></italic><sub>2</sub>, &#x02026;, <italic><bold>a</bold><sub>R</sub></italic>] &#x02208;<italic>&#x0211d;</italic><sup><italic>I</italic>&#x000d7;<italic>R</italic></sup>, <italic><bold>B</bold></italic> = [<italic><bold>b</bold></italic><sub>1</sub>, <italic><bold>b</bold></italic><sub>2</sub>, &#x02026;, <italic><bold>b</bold></italic><sub>R</sub>] &#x02208;<italic>&#x0211d;</italic><sup><italic>J</italic>&#x000d7;<italic>R</italic></sup> and <italic><bold>C</bold></italic> = [<italic><bold>c</bold></italic><sub>1</sub>, <italic><bold>c</bold></italic><sub>2</sub>, &#x02026;, <italic><bold>c</bold></italic><sub>R</sub>] &#x02208;<italic>&#x0211d;</italic><sup><italic>K</italic>&#x000d7;<italic>R</italic></sup>. Their column vectors are expressed explicitly as: <bold><italic>a</italic></bold><sub><italic>r</italic></sub> = [<italic>a</italic><sub>1<italic>r</italic></sub>, <italic>a</italic><sub>2<italic>r</italic></sub>, <italic>a</italic><sub>3<italic>r</italic></sub>, &#x02026;, <italic>a</italic><sub><italic>Ir</italic></sub>]<sup><italic>T</italic></sup>, <italic><bold>b</bold></italic><sub><italic>r</italic></sub> = [<italic>b</italic><sub>1<italic>r</italic></sub>, <italic>b</italic><sub>2<italic>r</italic></sub>, <italic>b</italic><sub>3<italic>r</italic></sub>, &#x02026;, <italic>b</italic><sub><italic>Jr</italic></sub>]<sup><italic>T</italic></sup>, <italic><bold>c</bold></italic><sub><italic>r</italic></sub> = [<italic>c</italic><sub>1<italic>r</italic></sub>, <italic>c</italic><sub>2<italic>r</italic></sub>, <italic>c</italic><sub>3<italic>r</italic></sub>, &#x02026;, <italic>c</italic><sub><italic>Kr</italic></sub>]<sup><italic>T</italic></sup>.</p><p id="P13">Elementwise, <xref rid="FD1" ref-type="disp-formula">Eq. (1)</xref> can be expressed as:
<disp-formula id="FD2"><label>(2)</label><mml:math display="block" id="M2"><mml:mrow><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02248;</mml:mo><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>R</mml:mi></mml:munderover><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>c</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mspace width="thickmathspace"/><mml:mtext>for</mml:mtext><mml:mspace width="thickmathspace"/><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x02026;</mml:mo><mml:mi>I</mml:mi><mml:mo>;</mml:mo><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x02026;</mml:mo><mml:mi>J</mml:mi><mml:mo>;</mml:mo><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x02026;</mml:mo><mml:mi>K</mml:mi><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula></p><p id="P14">CPD of a 3-dimensional tensor can be formalized as follows:
<disp-formula id="FD3"><label>(3)</label><mml:math display="block" id="M3"><mml:mrow><mml:munder><mml:mrow><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow><mml:mover accent="true"><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>^</mml:mo></mml:mover></mml:munder><mml:mrow><mml:mo>&#x02016;</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mover accent="true"><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mo>&#x02016;</mml:mo></mml:mrow><mml:mo>;</mml:mo><mml:mtext>Where</mml:mtext><mml:mo>,</mml:mo><mml:mover accent="true"><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mo>^</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>R</mml:mi></mml:munderover><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">a</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">b</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">c</mml:mi><mml:mi>r</mml:mi></mml:msub></mml:mrow><mml:mo>&#x02261;</mml:mo><mml:mrow><mml:mo>&#x0301a;</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">B</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">C</mml:mi></mml:mrow><mml:mo>&#x0301b;</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>
where <italic><bold>Y</bold></italic> is the original tensor and <italic><bold>&#x00176;</bold></italic> is a low rank approximation of tensor <italic><bold>Y</bold></italic>. CPD of a tensor is typically computed using the alternating least square (ALS) algorithm where each factor matrix is iteratively solved using a least square method. This algorithm fixes all factor matrices, except one, to optimize for the non-fixed matrix and then repeats this procedure for each matrix repeatedly until it converges [<xref rid="R37" ref-type="bibr">37</xref>,<xref rid="R41" ref-type="bibr">41</xref>].</p><p id="P16">The CPD of a three-dimensional tensor is illustrated in <xref rid="F1" ref-type="fig">Fig. 1</xref>.</p></sec></sec></sec><sec id="S7"><label>3.</label><title>Problem statement and research objective</title><p id="P17">For assessing WMSD risks among construction workers, human-based data can potentially suffer from missing data points due to dropout from the data collection technology. However, an in-depth method in handling multiple imperfect datasets for assessing WMSD risks is missing in the existing literature. Tensor decomposition-based data fusion can be potentially useful in this regard. It may help understand the data distribution of each dataset and consider the correlation among risk indicators captured at multiple work settings or conditions to fill in the missing data points. Therefore, the objective of this research is to develop a method that applies CPD tensor decomposition techniques to fuse multiple imperfect datasets and replace missing data by considering the correlation among the risk indicators for assessing the WMSD risks among construction workers.</p></sec><sec id="S8"><label>4.</label><title>Proposed method</title><p id="P18"><xref rid="F2" ref-type="fig">Fig. 2</xref> provides a schematic overview of the proposed method. First, multiple risk-related incomplete datasets captured in multiple experimental settings are represented as high-dimensional tensors. Then these tensors are fused by applying the CPD tensor decomposition. The CPD first decomposes these tensors into factor matrices that represent the latent structures and collaborative relationships among all dimensions. Based on the correlation among different dimensions, the CPD then reconstructs a new dataset with all missing values filled. This fused new dataset is ready to be used for subsequent risk assessments. In the following subsections, the steps of the proposed method are described in detail.</p><sec id="S9"><label>4.1.</label><title>Incomplete datasets for WMSD risk assessment</title><p id="P19">For WMSD risk assessment in construction, the effects of various work-related factors on specific risk indicators are often measured. Therefore, a typical WMSD risk-related dataset contains information such as work settings/experimental conditions and specific risk indicators. Often, a WMSD risk may not be characterized with a single type of risk indicator and in that case, multiple types of WMSD risk indicators are measured, e.g., kinematics (flexion, abduction/adduction, internal/external rotation), muscle activity data recorded by EMG, and kinetic data recorded by force plates, and can be available in multiple datasets.</p><p id="P20">To describe the proposed method, consider two datasets DS1 and DS2 (denoted for <italic>Incomplete Dataset 1</italic> and <italic>Incomplete Dataset 2</italic> in <xref rid="F2" ref-type="fig">Fig. 2</xref>). Each dataset contains two different types of WMSD risk indicators collected from several subjects, investigating various work-induced risks for a number of work settings (e.g., working technique, working posture, working condition). This is common practice of WMSD risk assessment studies in construction, where different types of risk indicators are often measured from the same group of people and the same work settings for an in-depth understanding of a specific type of risk. In this case, a data value in DS1 or DS2 represents a value of a risk indicator measured from a subject at certain work settings. For example, to measure the knee WMSD risk, two common types of risk indicators are measured &#x02013; awkward knee rotations and maximum EMG of knee postural muscles. Awkward knee rotations may include flexion, abduction/adduction, and internal/external rotations. Maximum EMG of knee postural muscles may include measurements of maximum flexor and extensor muscle activations. The datasets can be represented in a tabular form as shown in <xref rid="T1" ref-type="table">Table 1</xref>.</p><p id="P21">In <xref rid="T1" ref-type="table">Table 1</xref>, each dataset contains a type of risk indicator measurements collected for several subjects at a number of work settings. The column <italic>subject</italic> contains subjects&#x02019; IDs. The columns <italic>setting</italic><sub>1</sub>, <italic>setting</italic><sub>2</sub>, &#x02026;, <italic>setting<sub>q</sub></italic> contain arrangements of <italic>q</italic> work settings at which measurements of <italic>m</italic> risk indicators in columns <italic>ind</italic><sub>1</sub><sup>1</sup>, <italic>ind</italic><sub>1</sub><sup>2</sup>, &#x02026;, <italic>ind</italic><sub>1</sub><sup><italic>m</italic></sup> are collected in DS1, and <italic>n</italic> risk indicators are collected in columns <italic>ind</italic><sub>2</sub><sup>1</sup>, <italic>ind</italic><sub>2</sub><sup>2</sup>, &#x02026;, <italic>ind</italic><sub>2</sub><sup><italic>n</italic></sup> in DS2. For example, the column <italic>setting</italic><sub>1</sub> can represent working postures that have two arrangements &#x02013; static and dynamic. The column <italic>setting</italic><sub>2</sub> can represent slopes of the working site that have three arrangements &#x02013; 0&#x000b0;, 15&#x000b0; and 30&#x000b0;. So, in the column <italic>setting</italic><sub>1</sub>, the values are either &#x02018;static&#x02019; or &#x02018;dynamic&#x02019; and in the column <italic>setting</italic><sub>2</sub> the values are either &#x02018;0&#x000b0;&#x02019;, &#x02018;15&#x000b0;&#x02019; or &#x02018;30&#x000b0;&#x02019;. If a study involves only these two settings, then the value of <italic>q</italic> will be 2. Similarly, the columns <italic>ind</italic><sub>1</sub><sup>1</sup>, <italic>ind</italic><sub>1</sub><sup>2</sup>, &#x02026;, <italic>ind</italic><sub>1</sub><sup><italic>m</italic></sup> can contain kinematics measures (flexion, abduction/adduction, internal/external rotation) and <italic>ind</italic><sub>2</sub><sup>1</sup>, <italic>ind</italic><sub>2</sub><sup>2</sup>, &#x02026;, <italic>ind</italic><sub>2</sub><sup><italic>n</italic></sup> can contain EMG measures of several muscles. To be more specific with an example, in DS1, a value of any of the <italic>m</italic> risk indicator columns can be a certain measurement value of a risk indicator (e.g., flexion) captured for certain subject at certain arrangements of <italic>q</italic> work settings. Note that in both datasets, the values of columns of <italic>subject</italic> and <italic>setting</italic><sub>1</sub>, <italic>setting</italic><sub>2</sub> &#x02026;, <italic>setting<sub>q</sub></italic> are identical; the only difference is the measurements of the risk indicators. These datasets become incomplete when a significant amount of risk indicator measurements are not captured during data collection process.</p></sec><sec id="S10"><label>4.2.</label><title>Tensor construction</title><p id="P22">Since the datasets contain multiple risk indicator measurements for the same combination of subjects and settings, the risk indicator measurements from both datasets are concatenated, yielding a new dataset (DS3) as illustrated in <xref rid="T2" ref-type="table">Table 2</xref>. This process is called early integration data fusion [<xref rid="R36" ref-type="bibr">36</xref>].</p><p id="P23">In <xref rid="T2" ref-type="table">Table 2</xref>, each risk indicator measurement in DS3 is collected for certain subject at different arrangements of <italic>q</italic> work settings. Thus, &#x02018;subject&#x02019; and each &#x02018;work setting&#x02019; are considered as dimensions of each risk indicator, resulting in a total of (<italic>1</italic> + <sub>q</sub>) dimensions. For multiple risk indicators, this method allows datasets to be represented as a &#x02018;multi-dimensional&#x02019; or &#x02018;multi-modal&#x02019; tensor, where each cell of the tensor contains a value of a certain risk indicator, measured for a subject at certain arrangements of <italic>q</italic> work settings. In this study, the &#x02018;subject&#x02019;, each &#x02018;work setting&#x02019;, and the entire set of &#x02018;risk indicators&#x02019; can be considered as dimensions of the tensor which has a total of (<italic>1</italic> + <italic>q</italic> + <italic>1</italic>) dimensions.</p><p id="P24">To illustrate this tensor representation with a simplified example, consider only one work setting (e.g., working postures) denoted by &#x02018;<italic>setting</italic>&#x02019;. In this case, DS3 can be represented as a three-dimensional tensor (<italic>subject</italic> &#x000d7; <italic>risk indicators</italic> &#x000d7; <italic>setting</italic>) (<italic><bold>M</bold></italic><sub><italic>i, j, k</italic></sub>) &#x02208; <italic>&#x0211d;</italic><sup><italic>s</italic>&#x000d7;<italic>r</italic>&#x000d7;<italic>t</italic></sup> (<xref rid="F3" ref-type="fig">Fig. 3</xref>), where <italic>i, j</italic> and <italic>k</italic> correspond to dimensions &#x02018;<italic>subject</italic>&#x02019;, &#x02018;<italic>risk indicators</italic>&#x02019; and &#x02018;<italic>setting</italic>&#x02019; respectively; and <italic>s, r</italic> and <italic>t</italic> are their sizes where <italic>i</italic>&#x02208;{1,2, &#x02026;<italic>s</italic>}, <italic>j</italic>&#x02208;{1,2, &#x02026;<italic>r</italic>} and <italic>k</italic>&#x02208;{1,2, &#x02026;<italic>t</italic>}. Each slice along the <italic>setting</italic> axis in <xref rid="F3" ref-type="fig">Fig. 3</xref> represents a matrix (<italic><bold>M</bold></italic><sub><italic>i, j</italic></sub>) &#x02208; <italic>&#x0211d;</italic><sup><italic>s</italic>&#x000d7;<italic>r</italic></sup> given a setting arrangement <italic>k</italic>.</p></sec><sec id="S11"><label>4.3.</label><title>Tensor decomposition</title><p id="P25">Following tensor construction, CPD is applied to decompose the tensor into factor matrices. As mentioned earlier, CPD learns the latent structures and collaborative relationships among the dimensions of a tensor. Thus, each factor matrix represents the latent relationships between the corresponding dimension and all the other dimensions captured by latent factor <italic>R</italic> (previously discussed in <xref rid="S6" ref-type="sec">Section 2.3.1</xref>). Before applying CPD, the latent factor <italic>R</italic> should be computed properly to ensure unique mapping of the original tensor to the decomposed tensor in the form of factor matrices. In the above example, (<italic><bold>M</bold></italic><sub><italic>i, j, k</italic></sub>) &#x02208; <italic>&#x0211d;</italic><sup><italic>s</italic>&#x000d7;<italic>r</italic>&#x000d7;<italic>t</italic></sup> is associated with three dimensions &#x02013; &#x02018;<italic>subject</italic>&#x02019;, &#x02018;<italic>risk indicators</italic>&#x02019; and &#x02018;<italic>setting</italic>&#x02019;; hence, three factor matrices are obtained after decomposition, denoted as <italic><bold>FM</bold></italic><sub><bold>1</bold></sub> &#x02208; <italic>&#x0211d;</italic><sup><italic>s</italic>&#x000d7;<italic>R</italic></sup>, <italic><bold>FM</bold></italic><sub><bold>2</bold></sub> &#x02208; <italic>&#x0211d;</italic><sup><italic>r</italic>&#x000d7;<italic>R</italic></sup>, <italic><bold>FM</bold></italic><sub><bold>3</bold></sub> &#x02208; <italic>&#x0211d;</italic><sup><italic>t</italic>&#x000d7;<italic>R</italic></sup>, as shown in <xref rid="F4" ref-type="fig">Fig. 4</xref>. Here, the factor matrix <italic><bold>FM</bold></italic><sub><bold>1</bold></sub> obtained for the dimension &#x02018;<italic>subject</italic>&#x02019; represents the latent relationships between the subject and the risk indictors collected at different arrangements of the setting. <italic><bold>FM</bold></italic><sub><bold>2</bold></sub> is obtained for the dimension &#x02018;<italic>risk indicators</italic>&#x02019; and represents the latent relationships between the risk indicators and the subjects at different arrangements of the setting for which those risk indicators were collected. <italic><bold>FM</bold></italic><sub><bold>3</bold></sub> is obtained for the dimension &#x02018;<italic>setting</italic>&#x02019; and represents the latent relationships between different arrangements of the setting and the risk indicators collected for the subjects at those arrangements of the setting. CPD leverages all the inter-dimensional correlations among all the dimensions of a tensor and performs data fusion based on those correlations to create a reconstructed tensor. The structures of the factor matrices and the method of computing <italic>R</italic> will be discussed further in &#x0201c;<xref rid="S13" ref-type="sec">Implementation and results</xref>&#x0201d;.</p></sec><sec id="S12"><label>4.4.</label><title>Tensor reconstruction</title><p id="P26">After tensor decomposition, the resulting factor matrices are used to reconstruct a new fused tensor where the missing values are filled and values of the WMSD risk indicators are fused based on the inter-dimensional correlations. In the prior example, such tensor can be denoted as <inline-formula><mml:math display="inline" id="M4"><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mstyle mathvariant="bold" mathsize="normal"><mml:mi mathvariant="bold-italic">M</mml:mi></mml:mstyle><mml:msub><mml:mo>&#x02032;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mi>&#x0211d;</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mo>&#x000d7;</mml:mo><mml:mi>r</mml:mi><mml:mo>&#x000d7;</mml:mo><mml:mi>t</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo></mml:mrow></mml:math></inline-formula> which is computed using the following equation:
<disp-formula id="FD4"><label>(4)</label><mml:math display="block" id="M5"><mml:mrow><mml:mstyle mathvariant="bold" mathsize="normal"><mml:mi mathvariant="bold-italic">M</mml:mi></mml:mstyle><mml:msub><mml:mo>&#x02032;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>R</mml:mi></mml:munderover><mml:mrow><mml:mi mathvariant="bold-italic">F</mml:mi><mml:msub><mml:mi mathvariant="bold-italic">M</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02218;</mml:mo><mml:mi mathvariant="bold-italic">F</mml:mi><mml:msub><mml:mi mathvariant="bold-italic">M</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02218;</mml:mo><mml:mi mathvariant="bold-italic">F</mml:mi><mml:msub><mml:mi mathvariant="bold-italic">M</mml:mi><mml:mrow><mml:mn>3</mml:mn><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:math></disp-formula>
where <italic><bold>FM</bold></italic><sub><bold>1</bold><italic>r</italic></sub>, <italic><bold>FM</bold></italic><sub><bold>2</bold><italic>r</italic></sub>, <italic><bold>FM</bold></italic><sub><bold>3</bold><italic>r</italic></sub> are the column vectors of <italic><bold>FM</bold></italic><sub>1</sub>, <italic><bold>FM</bold></italic><sub>2</sub> and <italic><bold>FM</bold></italic><sub>3</sub> respectively, <italic>R</italic> is the latent factor and &#x02018;&#x02218;&#x02019; is the vector outer product. Using the newly reconstructed tensor, values of the WMSD risk indicators for different subjects at different settings are extracted and reorganized into a tabular form, readily available for risk assessment.</p></sec></sec><sec id="S13"><label>5.</label><title>Implementation and results</title><sec id="S14"><label>5.1.</label><title>Original datasets collected from previous experiments</title><p id="P28">The current study considered two risk-related datasets that were collected from the authors&#x02019; prior human subject laboratory experimental studies. These studies assessed work-related factors for knee WMSDs among residential construction roofers who work on sloped environments. One dataset contains calculated knee rotation (kinematics) data, representing five knee rotational angles &#x02013; flexion, abduction, adduction, internal and external rotation [<xref rid="R42" ref-type="bibr">42</xref>]. The second data set contains EMG data, describing muscle activation of five knee postural muscles &#x02013; biceps femoris, recus femoris, semitendinosus, vastus lateralis and vastus medialis [<xref rid="R43" ref-type="bibr">43</xref>]. Awkward knee rotations and maximum muscle activation have been identified as two quantitative risk indicators of WMSDs in low extremities [<xref rid="R44" ref-type="bibr">44</xref>,<xref rid="R45" ref-type="bibr">45</xref>]. The five knee rotation angles represent the WMSD risk associated with awkward and extreme kneeling postures, while the EMG data represent the WMSD risk associated with heightened activation of knee postural muscles that might cause knee joint overloading. Both types of data were collected from the experimental study in which 9 subjects simulated the shingle installation roofing task at three roof slopes (0&#x000b0;, 15&#x000b0; and 30&#x000b0;) with two kneeling postures (static and dynamic). Each participant performed the task for five trials. Although the experiment was designed and implemented carefully, both datasets initially collected from the experiment had missing values. About 2% data were missing in the knee rotation dataset and about 20% data were missing in the EMG dataset at random. In this study, observations with missing data were intentionally removed from the initial datasets for the sake of obtaining complete datasets for evaluation. In the knee rotation dataset, each data point represents the maximum angle value of a knee rotation for a certain subject at a specific slope, posture and trial. In the EMG dataset, each data point represents the maximum normalized EMG value of a knee postural muscle of a certain subject at a specific slope, posture and trial. Since only maximum measurements in both datasets were collected as risk indicators, the two datasets could be easily synchronized.</p></sec><sec id="S15"><label>5.2.</label><title>Imperfect (or incomplete) datasets construction (datasets with missing values)</title><p id="P29">To evaluate the performance of the proposed method in dealing with missing data, this study was designed to fuse knee rotation and EMG datasets for different proportions of missing values using the CPD method. A proportion of data was randomly removed from both knee kinematics and EMG datasets with percentages of missing data as shown in <xref rid="T3" ref-type="table">Table 3</xref>. Therefore, fourteen incomplete datasets&#x02014;seven for knee rotations and seven for EMG&#x02014;were constructed with different portions of missing values.</p></sec><sec id="S16"><label>5.3.</label><title>Tensor construction</title><p id="P30">The incomplete rotation and EMG datasets were then used for tensor construction. As the knee rotation and EMG data were collected from the same subjects at the same work settings (i.e., roof slope, working posture, and trial), they could be combined by the early integration method [<xref rid="R46" ref-type="bibr">46</xref>]. In this method, the columns of these two types of risk indicators were concatenated to form a single set of length ten vectors which was referred as &#x02018;risk indicators&#x02019; in this study. For tensor construction, the following forty-nine incomplete datasets as shown in <xref rid="T4" ref-type="table">Table 4</xref> were considered.</p><p id="P31">An <italic>N</italic> dimensional tensor <italic><bold>T</bold></italic><sub><italic>N</italic></sub> &#x02208;<italic>&#x0211d;</italic><sup><italic>I</italic><sub>1</sub>&#x000d7;<italic>I</italic><sub>2</sub>&#x000d7;&#x02026;&#x02026;. &#x000d7;<italic>I<sub>N</sub></italic></sup> has size (<italic><bold>T</bold><sub>N</sub></italic>) = <italic>I</italic><sub>1</sub> &#x000d7; <italic>I</italic><sub>2</sub> &#x000d7; &#x02026;&#x02026;&#x02026; &#x000d7; <italic>I</italic><sub><italic>N</italic></sub>, where, <italic>I</italic><sub><italic>i</italic></sub> is the size of its <italic>i<sup>th</sup></italic> dimension [<xref rid="R31" ref-type="bibr">31</xref>]. As the risk indicators were measured from multiple subjects at different working postures, roof slopes and multiple trials, the collected data could be represented as a five-dimensional sparse tensor <italic><bold>T</bold></italic> &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>1</sub>&#x000d7;<italic>I</italic><sub>2</sub>&#x000d7;<italic>I</italic><sub>3</sub>&#x000d7;<italic>I</italic><sub>4</sub>&#x000d7;<italic>I</italic><sub>5</sub></sup>, where <italic>I</italic><sub><italic>1</italic></sub> = No. of subjects (total 9), <italic>I</italic><sub><italic>2</italic></sub> = No. of risk indicators [EMG and kinematics; total 10 (5 knee rotation angles and 5 knee muscle EMG)], <italic>I</italic><sub><italic>3</italic></sub> = No. of trials (total 5), <italic>I</italic><sub><italic>4</italic></sub> = No. of postures [total 2 (static and dynamic)], <italic>I</italic><sub><italic>5</italic></sub> = No. of slopes [total 3 (0&#x000b0;,15&#x000b0;,30&#x000b0;)]. In short, the tensor <italic><bold>T</bold></italic> maps the knee rotations and EMG measurements of nine subjects, obtained from five trials, performed on three roof slopes, using two postures. Tensor <italic><bold>T</bold></italic> was referred to as &#x02018;incomplete tensor&#x02019; in this study.</p><p id="P32">For a certain posture and slope, the sparse tensor <italic><bold>T</bold></italic> is denoted as <italic><bold>T</bold></italic><sub><italic>i</italic>, <italic>j</italic></sub> which represents the values of the risk indicators obtained from five trials performed by each subject at that roof slope and posture. In <italic><bold>T</bold></italic><sub><italic>i, j</italic></sub>, <italic>i</italic> &#x02208; {1, 2}, with 1 and 2 representing the static and dynamic postures respectively; <italic>j</italic> &#x02208; {1,2,3}, with 1, 2 and 3 representing the 0&#x000b0;, 15&#x000b0; and 30&#x000b0; slopes respectively. Then, the five-dimensional tensor <italic><bold>T</bold></italic> can be illustrated in <xref rid="F5" ref-type="fig">Fig. 5</xref>.</p><p id="P33">Similarly, the original datasets collected from the previous experiments were constructed to a five-dimensional tensor <italic><bold>X</bold></italic> &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>1</sub>&#x000d7;<italic>I</italic><sub>2</sub>&#x000d7;<italic>I</italic><sub>3</sub>&#x000d7;<italic>I</italic><sub>4</sub>&#x000d7;<italic>I</italic><sub>5</sub></sup> as the benchmark data, which was referred to as the &#x0201c;original tensor&#x0201d; in this study and was used for validation of performance of the fusion method.</p></sec><sec id="S17"><label>5.4.</label><title>Tensor decomposition</title><p id="P34">Once a tensor <italic><bold>T</bold></italic> was constructed, the CPD tensor decomposition was applied to decompose the tensor <italic><bold>T</bold></italic> into five factor matrices, representing information in each dimension. The factor matrices were represented as matrices <italic><bold>A</bold>, <bold>B</bold>, <bold>C</bold>, <bold>D</bold>,</italic> and <italic><bold>E</bold></italic>, providing latent information in dimensions of subjects, risk indicators, trials, postures, and slopes, respectively.</p><p id="P35">CPD factorizes the five-dimensional sparse tensor <italic><bold>T</bold></italic> &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>1</sub>&#x000d7;<italic>I</italic><sub>2</sub>&#x000d7;<italic>I</italic><sub>3</sub>&#x000d7;<italic>I</italic><sub>4</sub>&#x000d7;<italic>I</italic><sub>5</sub></sup> as:
<disp-formula id="FD5"><label>(5)</label><mml:math display="block" id="M6"><mml:mrow><mml:mi mathvariant="bold-italic">T</mml:mi><mml:mo>&#x02248;</mml:mo><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>R</mml:mi></mml:munderover><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">a</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">b</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">c</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">d</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">e</mml:mi><mml:mi>r</mml:mi></mml:msub></mml:mrow><mml:mo>&#x02261;</mml:mo><mml:mrow><mml:mo>&#x0301a;</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">B</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">C</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">D</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">E</mml:mi></mml:mrow><mml:mo>&#x0301b;</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>
where <italic><bold>a</bold><sub>r</sub></italic> &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>1</sub></sup>, <italic><bold>b</bold><sub>r</sub></italic> &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>2</sub></sup>, <italic><bold>c</bold><sub>r</sub></italic> &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>3</sub></sup>, <italic><bold>d</bold><sub>r</sub></italic> &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>4</sub></sup>, <italic><bold>e</bold><sub>r</sub></italic> &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>5</sub></sup>, for <italic>r</italic> = 1, &#x02026;, <italic>R</italic>. Factor matrix <bold>A</bold> = [<italic><bold>a</bold></italic><sub>1</sub>,<italic><bold>a</bold></italic><sub>2</sub>&#x02026;&#x02026;&#x02026;<italic><bold>a</bold></italic><sub><italic>R</italic></sub>] &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>1</sub>&#x000d7;<italic>R</italic></sup>, <bold>B</bold> = [<italic><bold>b</bold></italic><sub>1</sub>, <italic><bold>b</bold></italic><sub>2</sub>&#x02026;&#x02026;&#x02026;<bold><italic>b</italic></bold><sub><italic>R</italic></sub>] &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>2</sub>&#x000d7;<italic>R</italic></sup>, <bold>C</bold>= [ <italic><bold>c</bold></italic><sub>1</sub>,<italic><bold>c</bold></italic><sub>2</sub>&#x02026;&#x02026;&#x02026;<italic><bold>c</bold></italic><sub><italic>R</italic></sub>] &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>3</sub>&#x000d7;<italic>R</italic></sup>, <bold>D</bold> = [ <bold><italic>d</italic></bold><sub>1</sub>,<bold><italic>d</italic></bold><sub>2</sub>&#x02026;&#x02026;&#x02026; <bold><italic>d</italic></bold><sub><italic>R</italic></sub>] &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>4</sub>&#x000d7;<italic>R</italic></sup>, and <bold>E</bold>= [<italic><bold>e</bold></italic><sub>1</sub>, <italic><bold>e</bold></italic><sub>2</sub>&#x02026;<italic>e</italic><sub><italic>R</italic></sub>] &#x02208; &#x0211d;<sup><italic>I</italic><sub>5</sub>&#x000d7;<italic>R</italic></sup> are collections of all <italic>R</italic> vectors of <italic><bold>a</bold><sub>r</sub></italic>, <italic><bold>b</bold></italic><sub><italic>r</italic></sub>, <italic><bold>c</bold><sub><italic>r</italic></sub>, <italic><bold>d</bold></italic><sub><italic>r</italic></sub>, <italic><bold>e</bold></italic><sub><italic>r</italic></sub></italic> respectively.</p><p id="P37"><xref rid="F6" ref-type="fig">Fig. 6</xref> illustrates the process of the CPD tensor decomposition.</p><p id="P38"><xref rid="FD5" ref-type="disp-formula">Eq. (5)</xref> can be represented as:
<disp-formula id="FD6"><label>(6)</label><mml:math display="block" id="M8"><mml:mrow><mml:mi mathvariant="bold-italic">T</mml:mi><mml:mo>&#x02248;</mml:mo><mml:munder><mml:mo>&#x02211;</mml:mo><mml:mi>R</mml:mi></mml:munder><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">B</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">C</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">D</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">E</mml:mi><mml:mi>r</mml:mi></mml:msub></mml:mrow></mml:mrow></mml:math></disp-formula></p><p id="P39">Factor matrices <italic><bold>A</bold></italic>, <italic><bold>B</bold></italic>, <italic><bold>C</bold></italic>, <italic><bold>D</bold></italic> and <italic><bold>E</bold></italic> are computed by solving the following optimization equation:
<disp-formula id="FD7"><label>(7)</label><mml:math display="block" id="M9"><mml:mrow><mml:mtext>arg min</mml:mtext><mml:mrow><mml:mo>&#x02016;</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">T</mml:mi><mml:mo>&#x02212;</mml:mo><mml:munder><mml:mo>&#x02211;</mml:mo><mml:mi>R</mml:mi></mml:munder><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">B</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">C</mml:mi><mml:mi>r</mml:mi></mml:msub></mml:mrow><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">D</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">E</mml:mi><mml:mi>r</mml:mi></mml:msub></mml:mrow><mml:mo>&#x02016;</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula></p><p id="P40">To solve <xref rid="FD7" ref-type="disp-formula">Eq. (7)</xref>, the alternating least squares algorithm was applied using an optimization-based program <italic>cpd_als</italic> in Tensor Toolbox from MATLAB [<xref rid="R47" ref-type="bibr">47</xref>].</p><sec id="S18"><label>5.4.1.</label><title>Selection of R (number of rank-1 tensors) for CPD</title><p id="P41">Before applying CPD, a proper <italic>R</italic> needs to be selected to guarantee a unique mapping of the original tensor to the decomposed tensor. To determine the smallest number of rank-1 tensors required for CPD, the core consistency diagnostic method (CORCONDIA) described in [<xref rid="R48" ref-type="bibr">48</xref>] was applied with the N-way toolbox of MATLAB [<xref rid="R49" ref-type="bibr">49</xref>]. In this method, a Tucker core is used for assessing the appropriateness of a CPD model. Tucker core array, when presenting the coordinates of the parts of the multi-dimensional array within the subspaces, is considered as the regression of a multi-dimensional array onto the subspaces characterized by the factor matrices [<xref rid="R48" ref-type="bibr">48</xref>]. In the CONCORDIA method, to signify an optimal representation of the multi-dimensional array with respect to the subspaces defined by the factors, a parameter called &#x0201c;core consistency&#x0201d; is measured. A core consistency value close to 100% indicates an appropriate CPD model that can properly represent the variability of the factors. With the increase of the number of components (<italic>R</italic>), the core consistency value typically increases up to a certain point and then decreases. To apply CPD in this study, one component model (<italic>R</italic> = 1) was first attempted and then different values of <italic>R</italic> were tried by gradually increasing the number of components until the core consistency value reached close to 100%. Those values of <italic>R</italic> with the highest core consistencies were considered for CPD application.</p></sec></sec><sec id="S19"><label>5.5.</label><title>Tensor reconstruction (fused dataset for risk assessment)</title><p id="P42">Once the tensor was decomposed into factor matrices, these factor matrices were used to reconstruct a new tensor from its decomposed state. This reconstructed tensor is an approximate mapping of the sparse tensor <italic><bold>T</bold></italic>, which was referred to as &#x02018;fused tensor&#x02019; and represented as <italic><bold>T</bold></italic> &#x02032; &#x02208; <italic>&#x0211d;</italic><sup><italic>I</italic><sub>1</sub>&#x000d7;<italic>I</italic><sub>2</sub>&#x000d7;<italic>I</italic><sub>3</sub>&#x000d7;<italic>I</italic><sub>4</sub>&#x000d7;<italic>I</italic><sub>5</sub></sup>, calculated by:
<disp-formula id="FD8"><label>(8)</label><mml:math display="block" id="M10"><mml:mrow><mml:mi mathvariant="bold-italic">T</mml:mi><mml:mo>&#x02032;</mml:mo><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>R</mml:mi></mml:munderover><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">A</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">B</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">C</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">D</mml:mi><mml:mi>r</mml:mi></mml:msub><mml:mo>&#x02218;</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">E</mml:mi><mml:mi>r</mml:mi></mml:msub></mml:mrow></mml:mrow></mml:math></disp-formula></p><p id="P43">In this tensor, all the missing values were imputed, and the risk indicator values were reconstructed. Tensorlab (a MATLAB package for tensor computation) was used for the imputation and reconstruction [<xref rid="R50" ref-type="bibr">50</xref>]. Note that the factor matrices computed by <xref rid="FD7" ref-type="disp-formula">Eq. (7)</xref> ensured the minimal reconstruction error during the imputation and reconstruction of the risk indicators. The reconstructed risk indicators were then extracted to a spread sheet, readily for use in WMSD risk assessment.</p><p id="P44">To allow for reliable observations, the procedures in <xref rid="S15" ref-type="sec">Subsections 5.2</xref> to <xref rid="S19" ref-type="sec">5.5</xref> (except for construction of the five-dimensional tensor <italic><bold>X</bold></italic> from the original datasets) were repeated three times. In each repeat, the random removal of certain portion of data as designed earlier led to different preservation in each dataset. Three repeats ensured a two-thirds (67%) probability that the averaged results were more accurate than a single experiment while maintaining time-efficiency in implementation.</p></sec><sec id="S20"><label>5.6.</label><title>Performance analysis of the data fusion</title><p id="P45">The performance of the data fusion method was evaluated by comparing the reconstructed data to the original data using two statistical measures&#x02014;root mean square error (RMSE) and mean absolute error (MAE). RMSE and MAE were used to measure the standard error and average magnitude of the error of the fused tensor, respectively [<xref rid="R51" ref-type="bibr">51</xref>,<xref rid="R52" ref-type="bibr">52</xref>].</p><p id="P46">The <italic>RMSE</italic> was calculated using the following equation:
<disp-formula id="FD9"><label>(9)</label><mml:math display="block" id="M11"><mml:mrow><mml:mi>R</mml:mi><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mi>E</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mfrac><mml:mn>1</mml:mn><mml:mi>N</mml:mi></mml:mfrac><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>I</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:munderover><mml:mrow><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>I</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:munderover><mml:mrow><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>3</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>I</mml:mi><mml:mn>3</mml:mn></mml:msub></mml:mrow></mml:munderover><mml:mrow><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>4</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>I</mml:mi><mml:mn>4</mml:mn></mml:msub></mml:mrow></mml:munderover><mml:mrow><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>5</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>I</mml:mi><mml:mn>5</mml:mn></mml:msub></mml:mrow></mml:munderover><mml:mrow><mml:msup><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>4</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>5</mml:mn></mml:msub></mml:mrow></mml:msub><mml:mo>&#x02212;</mml:mo><mml:mi>T</mml:mi><mml:msub><mml:mo>&#x02032;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>4</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>5</mml:mn></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:mrow></mml:mrow></mml:mrow></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:msqrt></mml:mrow></mml:math></disp-formula></p><p id="P47">Here, <italic>X</italic><sub><italic>i</italic><sub>1</sub>, <italic>i</italic><sub>2</sub>, <italic>i</italic><sub>3</sub>, <italic>i</italic><sub>4</sub>, <italic>i</italic><sub>5</sub></sub> is an element (a value of a risk indicator of a certain subject at a certain slope, posture and trial) in the original tensor <italic>X</italic> and <italic>T</italic>&#x02032;<sub><italic>i</italic><sub>1</sub>, <italic>i</italic><sub>2</sub>, <italic>i</italic><sub>3</sub>, <italic>i</italic><sub>4</sub>, <italic>i</italic><sub>5</sub></sub> is an element in the re-constructed tensor (fused dataset). <italic>N</italic> is the total number of cells of the tensor which was computed as <italic>N</italic> = <italic>I</italic><sub>1</sub> &#x000d7; <italic>I</italic><sub>2</sub> &#x000d7; <italic>I</italic><sub>3</sub> &#x000d7; <italic>I</italic><sub>4</sub> &#x000d7; <italic>I</italic><sub>5</sub>.</p><p id="P48">The <italic>MAE</italic> was calculated using the following equation:
<disp-formula id="FD10"><label>(10)</label><mml:math display="block" id="M12"><mml:mrow><mml:mi>M</mml:mi><mml:mi>A</mml:mi><mml:mi>E</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mi>N</mml:mi></mml:mfrac><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>I</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:mrow></mml:munderover><mml:mrow><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>I</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:mrow></mml:munderover><mml:mrow><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>3</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>I</mml:mi><mml:mn>3</mml:mn></mml:msub></mml:mrow></mml:munderover><mml:mrow><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>4</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>I</mml:mi><mml:mn>4</mml:mn></mml:msub></mml:mrow></mml:munderover><mml:mrow><mml:munderover><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>5</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>I</mml:mi><mml:mn>5</mml:mn></mml:msub></mml:mrow></mml:munderover><mml:mrow><mml:mrow><mml:mo>|</mml:mo><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>4</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>5</mml:mn></mml:msub></mml:mrow></mml:msub><mml:mo>&#x02212;</mml:mo><mml:mi>T</mml:mi><mml:msub><mml:mo>&#x02032;</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mn>1</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>3</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>4</mml:mn></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>i</mml:mi><mml:mn>5</mml:mn></mml:msub></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo>|</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:mrow></mml:mrow></mml:mrow></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula></p><p id="P49">The performance of the data fusion method was also evaluated by comparing the risk assessment results obtained from tensor <italic>T</italic>&#x02032; to those obtained from tensor <italic>X</italic> collected in previous experimental studies. In this comparison, consistency of the effects of roof slope and working posture on five knee rotation angles and five knee postural muscles&#x02019; EMG between the original and fused datasets were considered as a performance metric. In previous studies, the authors explored if the work-related factors (i.e., roof slope and working posture) are associated with knee WMSDs among construction roofers [<xref rid="R42" ref-type="bibr">42</xref>,<xref rid="R43" ref-type="bibr">43</xref>]. Those studies considered the original tensor <italic>(X)</italic> for risk assessments, whereas in this study, the fused tensor (<italic>T</italic>&#x02032;) was considered for assessing the similar risk to see if the risk assessment results were consistent. This way, the performance of the fusion method could be evaluated.</p><p id="P50">The consistency (Con) was computed using the following equation:
<disp-formula id="FD11"><label>(11)</label><mml:math display="block" id="M13"><mml:mrow><mml:mtext>Con</mml:mtext><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>P</mml:mi></mml:msubsup><mml:mrow><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mi>m</mml:mi><mml:msub><mml:mo>&#x02032;</mml:mo><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow><mml:mi>P</mml:mi></mml:mfrac><mml:mo>&#x000d7;</mml:mo><mml:mn>100.</mml:mn></mml:mrow></mml:math></disp-formula>
where, <italic>P</italic> = Total number of possible effects of roof slope, working posture and their interactions on five knee rotation angles and EMG of five knee postural muscles [total 30 = 3 factors (slope, posture, slope-posture interaction) &#x000d7; 10 response variables (5 knee rotations and 5 EMG measurements)].</p><p id="P52"><italic>m</italic><sub><italic>t</italic></sub> denotes the effects of the factors (slope and posture) on the response variables obtained from the original tensor (<italic>X</italic>).</p><p id="P53"><italic>m&#x02032;<sub>i</sub></italic> denotes the effects of the factors (slope and posture) on the response variables obtained from the fused tensor (<italic>T</italic>&#x02032;).</p><disp-formula id="FD12"><mml:math display="block" id="M14"><mml:mrow><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msubsup><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mo>&#x02032;</mml:mo></mml:msubsup></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtable><mml:mtr><mml:mtd><mml:mrow><mml:mn>1</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi>i</mml:mi><mml:mi>f</mml:mi></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mo>&#x02032;</mml:mo></mml:msubsup></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi>i</mml:mi><mml:mi>f</mml:mi></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x02260;</mml:mo><mml:msubsup><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mo>&#x02032;</mml:mo></mml:msubsup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula><p id="P54">All the resulting measurements of the performance metrics were averaged over the three trial repetitions.</p></sec><sec id="S21"><label>5.7.</label><title>Results</title><p id="P55"><xref rid="T5" ref-type="table">Table 5</xref> summarizes the results, including the RMSE, MAE, consistency (Con), number of rank-1 tensors <italic>R</italic> (latent factor) and core consistency values computed for the knee rotation and EMG datasets with different proportions of missing values.</p><sec id="S22"><label>5.7.1.</label><title>RMSE curves for datasets with different proportions of missing values</title><p id="P56"><xref rid="F7" ref-type="fig">Fig. 7</xref> displays the RMSE curves for different proportions of missing data from the knee rotation and EMG datasets. In <xref rid="F7" ref-type="fig">Fig. 7</xref>, rot-DS and EMG-DS means the rotation dataset and EMD dataset respectively.</p><p id="P57">Generally, the RMSE curve demonstrated an increasing trend with the increase of missing values in both rot-DS and EMG-DS. It was graphically observed that, for up to 40% missing data from rot-DS and up to 60% missing data from EMG-DS, the RMSE values displayed a small variation from 1 to 3. When the proportion of the missing data from rot-DS increased to 50%, the RMSE value increased drastically for different proportions of missing values in EMG-DS. The RMSE curves for 50%, 60% and 70% missing data from the rot-DS shifted slightly upward, indicating that with the increase of missing values in rot-DS, the RMSE values generally increased. From 50% to up to 70% missing values from rot-DS and up to 70% missing values from EMG-DS, the RMSE values varied considerably from 3.5 to 9. These results suggested that with the increase in missing values in both datasets, the performance of the fusion method decreased. The variation in RMSE values was small for up to 40% missing values in rot-DS and up to 60% missing values in EMG-DS. However, with further increase of missing values in both datasets, RMSE values varied considerably.</p></sec><sec id="S23"><label>5.7.2.</label><title>MAE curves for datasets with different proportions of missing values</title><p id="P58"><xref rid="F8" ref-type="fig">Fig. 8</xref> shows the MAE curves for different proportions of missing data from the knee rotation and EMG datasets. In <xref rid="F8" ref-type="fig">Fig. 8</xref>, rot-DS and EMG-DS means the rotation dataset and EMD dataset respectively.</p><p id="P59">The MAE curves demonstrated similar patterns to the RMSE curves. In general, the MAE curves demonstrated an increasing trend with the increase of missing values in both datasets. It was observed that, for up to 40% missing values from rot-DS and up to 60% missing values from EMG-DS, the variation of the MAE values was between 0.6 and 1.5. When missing values in EMG-DS were further increased (up to 70%), a sharp increase in MAE curve was observed when 30% data were missing in rot-DS. When the proportion of missing values in rot-DS was further increased to 50%, MAE values also tended to increase. Similar to the RMSE curves, for 50%, 60% and 70% missing data in rot-DS, the MAE curves shifted slightly upward, indicating that with the increase of missing values in rot-DS, MAE values also increased. In general, from 50%&#x02212;70% missing values in rotation dataset and up to 70% missing values in EMG dataset, the values of MAE varied considerably from 1.5&#x02013;4. These results indicated that with the increase in missing values in both datasets, the performance of the fusion method tended to decease as MAE values increased. The variation in MAE values was small for up to 40% missing values in rot-DS and up to 60% missing values in EMG-DS. However, with further increase of missing values in both datasets, the MAE values varied considerably.</p></sec><sec id="S24"><label>5.7.3.</label><title>Consistency curves for datasets with different proportions of missing values</title><p id="P61"><xref rid="F9" ref-type="fig">Fig. 9</xref> shows the consistency curves for different proportions of missing data from the knee rotation and EMG datasets. In <xref rid="F9" ref-type="fig">Fig. 9</xref>, rot-DS and EMG-DS means the rotation dataset and EMD dataset respectively.</p><p id="P62">The consistency curves generally demonstrated a decreasing trend with the increase of missing values in both datasets. It was observed that for up to 70% missing values from rot-DS and up to 60% missing values from EMG-DS, the consistency values varied to a small extent from 87% to 70%. However, for 70% missing values in EMG-DS, the consistency value decreased considerably for several configurations, particularly for 30%, 50% and 70% missing values in rot-DS. These results indicated that with the increase in missing values in both datasets, the performance of the fusion method decreased in general.</p></sec></sec></sec><sec id="S25"><label>6.</label><title>Discussion and study limitations</title><p id="P63">To ensure reliable risk assessment of WMSD, proper data collection is crucial. However, human-based data can potentially have missing data points due to dropout from the data collection technology. Moreover, risks sometimes cannot be fully quantified with a single risk indicator and thus multiple heterogeneous risk indicators are often collected for risk assessment. As a result, a viable method is needed that reserves the interrelation among multiple risk indicators and potential risk factors during rectification of these multiple imperfect and incomplete datasets.</p><p id="P64">This research proposed the use of a CPD tensor decomposition-based data fusion method, which fuses multiple imperfect datasets in order to replace missing data. The proposed method generates a fused dataset by learning the latent structures and collaborative relationships among the potential risk factors and the risk indicators captured in multiple work settings. By learning their correlations, this proposed method replaces the missing values in the multiple incomplete datasets with minimum artifacts. The sample analysis results demonstrated that for up to a significant portion of missing values from both datasets, the proposed method can generate a fused dataset, by which reliable risk assessment results can be obtained consistent to those obtained from the original datasets to a great extent (87% to 70%). The possible reason for some inconsistencies in the risk assessment results may be attributed to the presence of significant amount of missing values in the individual datasets. In spite of this, for the specific datasets used in this study, the proposed method performed consistently up to 40% of missing data in the original datasets and could be reasonably applied to those datasets. The RMSE and MAE values demonstrated that the reconstruction errors in fusion were very small up to this range of missing data. This suggests that the reconstructed fused values of the risk indicators in the new dataset are close to those in the original, unaltered experimental datasets and hence, a good fusion performance. Although there is no standard threshold for good or bad RMSE and MAE values, low values of RMSE and MAE (within 10% of the range of the reference values) are preferable. RMSE should be less than 10% of the range of target property value [<xref rid="R53" ref-type="bibr">53</xref>]. In the current study, for different portions of missing values, the obtained RMSE values ranged from 0.74% to 5% relative to the difference between the minimum and maximum values of the risk indicators in the original dataset. Additionally, for up to 40% missing values from the knee rotation dataset and 60% missing values from the EMG dataset, the RMSE values were within 15% of the average value of the risk indicators in the original dataset. Similarly, the MAE values were within 10% of the average value of the risk indicators in the original dataset. However, for the specific datasets used in this study, the RMSE and MAE values varied considerably as the proportion of missing data increased to 50% and 70% in the rotation and EMG datasets respectively. The possible reason might be the presence of significant amount of missing values in the individual datasets that drastically deteriorated the performance of the fusion method in reconstruction of the data and therefore negatively affected the RMSE and MAE values.</p><p id="P65">Apart from the previous three performance metrics, the performance of the present method can be further validated by the &#x02018;core consistency&#x02019; values obtained when measuring number of rank-1 tensors <italic>R</italic> (latent factor) for CPD. For different <italic>R</italic> values presented in <xref rid="T5" ref-type="table">Table 5</xref>, the &#x02018;core consistency&#x02019; values ranged from 88%-98%, indicating that the CPD tensor decomposition is able to represent the variability of the multi-dimensional EMG and rotation data in the subspace substantially, thus providing a good fusion result. Overall, the findings suggest that the proposed method can handle up to 40% missing data and can reasonably be applied in WMSD risk assessment studies. The performance of this method was evaluated based on: a) how close a reconstructed fused dataset was to the original dataset, and b) how consistent the risk assessment results from a fused dataset were to those from the original dataset.</p><p id="P66">Although the proposed method has been proven successful in generating a fused dataset for used in risk assessment studies, there are still some limitations that are worth further investigation. First, in this study, only the maximum knee rotation angles and maximum normalized muscle activity measurements were considered for fusion. It would be interesting to explore a detailed time series dataset instead of an aggregated metric such as maximum. Although data fusion is known to provide deeper and more accurate insights than those obtained from individual single data source [<xref rid="R54" ref-type="bibr">54</xref>], another limitation of this work is the performance of the data fusion was only assessed by comparing the risk assessment results obtained from the fused datasets to the original dataset. The fused datasets were proven to be statistically similar to the original dataset, indicating that the fusion method was successful in reconstruction of a dataset when a significant amount of data are missing. Yet, how a fused dataset outperforms the individual incomplete datasets by which the fused dataset is constructed in terms of performance of risk assessment has not been measured.</p></sec><sec id="S26"><label>7.</label><title>Conclusion and future extension</title><p id="P67">The current research proposed a method that applies the CPD tensor decomposition technique to fuse multiple imperfect and incomplete datasets, as well as replacing missing data for assessing WMSD risks among construction workers. The proposed method helps not only in replacing missing values, but also holds the correlation among the potential risk factors and the risk indicators during replacement. The method was validated by comparing the risk assessment results obtained from the fused datasets to those from the original experimental dataset. RMSE and MAE values were also computed to compare the performance of the fusion method with the increase of missing values. The validation results suggest that the reconstructed fused datasets were statistically similar to the original experimental datasets and therefore able to provide consistent risk assessment results up to a significant amount of missing values. This method can be used as an alternative risk assessment method when data collected from experimental studies or real construction sites are incomplete due to missing data points. The findings will help enable accurate assessment of work-related risk factors for WMSDs among construction workers. Although the proposed method only used two sparse datasets for fusion, it can be extended to fuse more datasets. In future, this method will be explored in fusing imperfect and incomplete multiple time series datasets. The performance of the fusion will also be further tested by comparing consistencies of the risk assessment results to those obtained from datasets with missing values.</p></sec></body><back><ack id="S27"><title>Acknowledgments</title><p id="P68">The authors acknowledge the support of the National Institute for Occupational Safety and Health (NIOSH), who funded this research. The findings and conclusions in this research are those of the authors and do not necessarily represent the opinion of the National Institute for Occupational Safety and Health, Centers for Disease Control and Prevention.</p></ack><fn-group><fn fn-type="COI-statement" id="FN1"><p id="P69">Declaration of competing interest</p><p id="P70">The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.</p></fn></fn-group><ref-list><title>References</title><ref id="R1"><label>[1]</label><mixed-citation publication-type="web"><collab>BLS</collab>, <source>Nonfatal occupational injuries and illnesses: cases with days away from work</source>, <comment>&#x0003c;<ext-link ext-link-type="uri" xlink:href="https://www.bls.gov/news.release/pdf/osh2.pdf">https://www.bls.gov/news.release/pdf/osh2.pdf</ext-link>&#x0003e; (4 March, 2019)</comment>, <year>2019</year>.</mixed-citation></ref><ref id="R2"><label>[2]</label><mixed-citation publication-type="journal"><name><surname>David</surname><given-names>G</given-names></name>, <article-title>Ergonomic methods for assessing exposure to risk factors for work-related musculoskeletal disorders</article-title>, <source>Occup. Med</source>
<volume>55</volume> (<issue>3</issue>) (<year>2005</year>) <fpage>190</fpage>&#x02013;<lpage>199</lpage>, <pub-id pub-id-type="doi">10.1093/occmed/kqi082</pub-id><comment>.</comment></mixed-citation></ref><ref id="R3"><label>[3]</label><mixed-citation publication-type="book"><name><surname>Data</surname><given-names>MC</given-names></name>, <source>Secondary Analysis of Electronic Health Records</source>, <publisher-name>Springer International Publishing</publisher-name>, <year>2016</year>.</mixed-citation></ref><ref id="R4"><label>[4]</label><mixed-citation publication-type="journal"><name><surname>Young</surname><given-names>W</given-names></name>, <name><surname>Weckman</surname><given-names>G</given-names></name>, <name><surname>Holland</surname><given-names>W</given-names></name>, <article-title>A survey of methodologies for the treatment of missing values within datasets: limitations and benefits</article-title>, <source>Theor. Issues Ergon. Sci</source>
<volume>12</volume> (<issue>1</issue>) (<year>2011</year>) <fpage>15</fpage>&#x02013;<lpage>43</lpage>, <pub-id pub-id-type="doi">10.1080/14639220903470205</pub-id><comment>.</comment></mixed-citation></ref><ref id="R5"><label>[5]</label><mixed-citation publication-type="journal"><name><surname>Rabanser</surname><given-names>S</given-names></name>, <name><surname>Shchur</surname><given-names>O</given-names></name>, <name><surname>G&#x000fc;nnemann</surname><given-names>S</given-names></name>, <article-title>Introduction to tensor decompositions and their applications in machine learning</article-title>, <source>arXiv Preprint</source>, <year>2017</year>
<comment><ext-link ext-link-type="uri" xlink:href="https://arxiv.org/abs/1711.10781v1">https://arxiv.org/abs/1711.10781v1</ext-link>.</comment></mixed-citation></ref><ref id="R6"><label>[6]</label><mixed-citation publication-type="journal"><name><surname>Lin</surname><given-names>JY</given-names></name>, <name><surname>Lu</surname><given-names>Y</given-names></name>, <name><surname>Tu</surname><given-names>X</given-names></name>, <article-title>How to avoid missing data and the problems they pose: design considerations</article-title>, <source>Shanghai Arch. Psychiatry</source>
<volume>24</volume> (<issue>3</issue>) (<year>2012</year>) <fpage>181</fpage>&#x02013;<lpage>184</lpage>, <pub-id pub-id-type="doi">10.3969/j.issn.1002-0829.2012.03.010</pub-id><comment>.</comment><pub-id pub-id-type="pmid">25324625</pub-id></mixed-citation></ref><ref id="R7"><label>[7]</label><mixed-citation publication-type="journal"><name><surname>Kang</surname><given-names>H</given-names></name>, <article-title>The prevention and handling of the missing data</article-title>, <source>Korean J. Anesthesiol</source>
<volume>64</volume> (<issue>5</issue>) (<year>2013</year>) <fpage>402</fpage>&#x02013;<lpage>406</lpage>, <pub-id pub-id-type="doi">10.4097/kjae.2013.64.5.402</pub-id><comment>.</comment><pub-id pub-id-type="pmid">23741561</pub-id></mixed-citation></ref><ref id="R8"><label>[8]</label><mixed-citation publication-type="journal"><name><surname>Button</surname><given-names>KS</given-names></name>, <name><surname>Ioannidis</surname><given-names>JP</given-names></name>, <name><surname>Mokrysz</surname><given-names>C</given-names></name>, <name><surname>Nosek</surname><given-names>BA</given-names></name>, <name><surname>Flint</surname><given-names>J</given-names></name>, <name><surname>Robinson</surname><given-names>ES</given-names></name>, <name><surname>Munaf&#x000f2;</surname><given-names>MR</given-names></name>, <article-title>Power failure: why small sample size undermines the reliability of neuroscience</article-title>, <source>Nat. Rev. Neurosci</source>
<volume>14</volume> (<issue>5</issue>) (<year>2013</year>) <fpage>365</fpage>&#x02013;<lpage>376</lpage>, <pub-id pub-id-type="doi">10.1038/nrn3475</pub-id><comment>.</comment><pub-id pub-id-type="pmid">23571845</pub-id></mixed-citation></ref><ref id="R9"><label>[9]</label><mixed-citation publication-type="journal"><name><surname>Malhotra</surname><given-names>NK</given-names></name>, <article-title>Analyzing marketing research data with incomplete information on the dependent variable</article-title>, <source>J. Mark. Res</source>
<volume>24</volume> (<issue>1</issue>) (<year>1987</year>) <fpage>74</fpage>&#x02013;<lpage>84</lpage>, <pub-id pub-id-type="doi">10.2307/3151755</pub-id><comment>.</comment></mixed-citation></ref><ref id="R10"><label>[10]</label><mixed-citation publication-type="journal"><name><surname>Hamer</surname><given-names>RM</given-names></name>, <name><surname>Simpson</surname><given-names>PM</given-names></name>, <article-title>Last observation carried forward versus mixed models in the analysis of psychiatric clinical trials</article-title>, <source>Am. J. Psychiatr</source>
<volume>166</volume> (<issue>6</issue>) (<year>2009</year>) <fpage>639</fpage>&#x02013;<lpage>641</lpage>, <pub-id pub-id-type="doi">10.1176/appi.ajp.2009.09040458</pub-id><comment>.</comment><pub-id pub-id-type="pmid">19487398</pub-id></mixed-citation></ref><ref id="R11"><label>[11]</label><mixed-citation publication-type="journal"><name><surname>Gelman</surname><given-names>A</given-names></name>, <name><surname>Raghunathan</surname><given-names>TE</given-names></name>, <article-title>Using conditional distributions for missing-data imputation</article-title>, <source>Stat. Sci</source>
<volume>15</volume> (<year>2001</year>) <fpage>268</fpage>&#x02013;<lpage>269</lpage>
<comment><ext-link ext-link-type="uri" xlink:href="http://www.stat.columbia.edu/~gelman/research/published/arnold2.pdf">http://www.stat.columbia.edu/~gelman/research/published/arnold2.pdf</ext-link>.</comment></mixed-citation></ref><ref id="R12"><label>[12]</label><mixed-citation publication-type="journal"><name><surname>Lahat</surname><given-names>D</given-names></name>, <name><surname>Adali</surname><given-names>T</given-names></name>, <name><surname>Jutten</surname><given-names>C</given-names></name>, <article-title>Multimodal data fusion: an overview of methods, challenges, and prospects</article-title>, <source>Proc. IEEE</source>
<volume>103</volume> (<issue>9</issue>) (<year>2015</year>) <fpage>1449</fpage>&#x02013;<lpage>1477</lpage>, <pub-id pub-id-type="doi">10.1109/JPROC.2015.2460697</pub-id><comment>.</comment></mixed-citation></ref><ref id="R13"><label>[13]</label><mixed-citation publication-type="journal"><name><surname>Amiri</surname><given-names>M</given-names></name>, <name><surname>Jensen</surname><given-names>R</given-names></name>, <article-title>Missing data imputation using fuzzy-rough methods</article-title>, <source>Neurocomputing</source>
<volume>205</volume> (<year>2016</year>) <fpage>152</fpage>&#x02013;<lpage>164</lpage>, <pub-id pub-id-type="doi">10.1016/j.neucom.2016.04.015</pub-id><comment>.</comment></mixed-citation></ref><ref id="R14"><label>[14]</label><mixed-citation publication-type="journal"><name><surname>Luengo</surname><given-names>J</given-names></name>, <name><surname>S&#x000e1;ez</surname><given-names>JA</given-names></name>, <name><surname>Herrera</surname><given-names>F</given-names></name>, <article-title>Missing data imputation for fuzzy rule-based classification systems</article-title>, <source>Soft. Comput</source>
<volume>16</volume> (<issue>5</issue>) (<year>2012</year>) <fpage>863</fpage>&#x02013;<lpage>881</lpage>, <pub-id pub-id-type="doi">10.1007/s00500-011-0774-4</pub-id><comment>.</comment></mixed-citation></ref><ref id="R15"><label>[15]</label><mixed-citation publication-type="journal"><name><surname>Winkler</surname><given-names>R</given-names></name>, <name><surname>Klawonn</surname><given-names>F</given-names></name>, <name><surname>Kruse</surname><given-names>R</given-names></name>, <article-title>Fuzzy c-means in high dimensional spaces</article-title>, <source>Int. J. Fuzzy Syst. Appl</source>
<volume>1</volume> (<issue>1</issue>) (<year>2011</year>) <fpage>1</fpage>&#x02013;<lpage>16</lpage>, <pub-id pub-id-type="doi">10.4018/IJFSA.2011010101</pub-id><comment>.</comment></mixed-citation></ref><ref id="R16"><label>[16]</label><mixed-citation publication-type="journal"><name><surname>Castanedo</surname><given-names>F</given-names></name>, <article-title>A review of data fusion techniques</article-title>, <source>Sci. World J</source>
<volume>2013</volume> (<year>2013</year>), <pub-id pub-id-type="doi">10.1155/2013/704504</pub-id><comment>.</comment></mixed-citation></ref><ref id="R17"><label>[17]</label><mixed-citation publication-type="journal"><name><surname>Razavi</surname><given-names>SN</given-names></name>, <name><surname>Haas</surname><given-names>CT</given-names></name>, <article-title>Multisensor data fusion for on-site materials tracking in construction</article-title>, <source>Autom. Constr</source>
<volume>19</volume> (<issue>8</issue>) (<year>2010</year>) <fpage>1037</fpage>&#x02013;<lpage>1046</lpage>, <pub-id pub-id-type="doi">10.1016/j.autcon.2010.07.017</pub-id><comment>.</comment></mixed-citation></ref><ref id="R18"><label>[18]</label><mixed-citation publication-type="journal"><name><surname>Razavi</surname><given-names>SN</given-names></name>, <name><surname>Haas</surname><given-names>CT</given-names></name>, <article-title>Reliability-based hybrid data fusion method for adaptive location estimation in construction</article-title>, <source>J. Comput. Civ. Eng</source>
<volume>26</volume> (<issue>1</issue>) (<year>2011</year>) <fpage>1</fpage>&#x02013;<lpage>10</lpage>, <pub-id pub-id-type="doi">10.1061/(ASCE)CP.1943-5487.0000101</pub-id><comment>.</comment></mixed-citation></ref><ref id="R19"><label>[19]</label><mixed-citation publication-type="book"><name><surname>Shahi</surname><given-names>A</given-names></name>, <name><surname>Cardona</surname><given-names>JM</given-names></name>, <name><surname>Haas</surname><given-names>CT</given-names></name>, <name><surname>West</surname><given-names>JS</given-names></name>, <name><surname>Caldwell</surname><given-names>GL</given-names></name>, <source>Activity-based data fusion for automated progress tracking of construction projects</source>, <publisher-name>Construction Research Congress 2012: Construction Challenges in a Flat World</publisher-name>, <year>2012</year>, pp. <fpage>838</fpage>&#x02013;<lpage>847</lpage>, , <pub-id pub-id-type="doi">10.1061/9780784412329.085</pub-id><comment>.</comment></mixed-citation></ref><ref id="R20"><label>[20]</label><mixed-citation publication-type="journal"><name><surname>Soman</surname><given-names>R</given-names></name>, <name><surname>Kyriakides</surname><given-names>M</given-names></name>, <name><surname>Onoufriou</surname><given-names>T</given-names></name>, <name><surname>Ostachowicz</surname><given-names>W</given-names></name>, <article-title>Numerical evaluation of multi-metric data fusion based structural health monitoring of long span bridge structures</article-title>, <source>Struct. Infrastruct. Eng</source>
<volume>14</volume> (<issue>6</issue>) (<year>2018</year>) <fpage>673</fpage>&#x02013;<lpage>684</lpage>, <pub-id pub-id-type="doi">10.1080/15732479.2017.1350984</pub-id><comment>.</comment></mixed-citation></ref><ref id="R21"><label>[21]</label><mixed-citation publication-type="journal"><name><surname>Anaissi</surname><given-names>A</given-names></name>, <name><surname>Makki Alamdari</surname><given-names>M</given-names></name>, <name><surname>Rakotoarivelo</surname><given-names>T</given-names></name>, <name><surname>Khoa</surname><given-names>N</given-names></name>, <article-title>A tensor-based structural damage identification and severity assessment</article-title>, <source>Sensors</source>
<volume>18</volume> (<issue>1</issue>) (<year>2018</year>) <fpage>111</fpage>, <pub-id pub-id-type="doi">10.3390/s18010111</pub-id><comment>.</comment></mixed-citation></ref><ref id="R22"><label>[22]</label><mixed-citation publication-type="journal"><name><surname>Chen</surname><given-names>J</given-names></name>, <name><surname>Qiu</surname><given-names>J</given-names></name>, <name><surname>Ahn</surname><given-names>C</given-names></name>, <article-title>Construction worker&#x02019;s awkward posture recognition through supervised motion tensor decomposition</article-title>, <source>Autom. Constr</source>
<volume>77</volume> (<year>2017</year>) <fpage>67</fpage>&#x02013;<lpage>81</lpage>, <pub-id pub-id-type="doi">10.1016/j.autcon.2017.01.020</pub-id><comment>.</comment></mixed-citation></ref><ref id="R23"><label>[23]</label><mixed-citation publication-type="journal"><name><surname>Cheng</surname><given-names>T</given-names></name>, <name><surname>Migliaccio</surname><given-names>GC</given-names></name>, <name><surname>Teizer</surname><given-names>J</given-names></name>, <name><surname>Gatti</surname><given-names>UC</given-names></name>, <article-title>Data fusion of real-time location sensing and physiological status monitoring for ergonomics analysis of construction workers</article-title>, <source>J. Comput. Civ. Eng</source>
<volume>27</volume> (<issue>3</issue>) (<year>2012</year>) <fpage>320</fpage>&#x02013;<lpage>335</lpage>, <pub-id pub-id-type="doi">10.1061/(ASCE)CP.1943-5487.0000222</pub-id><comment>.</comment></mixed-citation></ref><ref id="R24"><label>[24]</label><mixed-citation publication-type="journal"><name><surname>Cheng</surname><given-names>T</given-names></name>, <name><surname>Teizer</surname><given-names>J</given-names></name>, <name><surname>Migliaccio</surname><given-names>GC</given-names></name>, <name><surname>Gatti</surname><given-names>UC</given-names></name>, <article-title>Automated task-level activity analysis through fusion of real time location sensors and worker&#x02019;s thoracic posture data</article-title>, <source>Autom. Constr</source>
<volume>29</volume> (<year>2013</year>) <fpage>24</fpage>&#x02013;<lpage>39</lpage>, <pub-id pub-id-type="doi">10.1016/j.autcon.2012.08.003</pub-id><comment>.</comment></mixed-citation></ref><ref id="R25"><label>[25]</label><mixed-citation publication-type="journal"><name><surname>Chen</surname><given-names>H</given-names></name>, <name><surname>Luo</surname><given-names>X</given-names></name>, <name><surname>Zheng</surname><given-names>Z</given-names></name>, <name><surname>Ke</surname><given-names>J</given-names></name>, <article-title>A proactive workers&#x02019; safety risk evaluation framework based on position and posture data fusion</article-title>, <source>Autom. Constr</source>
<volume>98</volume> (<year>2019</year>) <fpage>275</fpage>&#x02013;<lpage>288</lpage>, <pub-id pub-id-type="doi">10.1016/j.autcon.2018.11.026</pub-id><comment>.</comment></mixed-citation></ref><ref id="R26"><label>[26]</label><mixed-citation publication-type="journal"><name><surname>Zheng</surname><given-names>Y</given-names></name>, <article-title>Methodologies for cross-domain data fusion: an overview</article-title>, <source>IEEE Trans. Big Data</source>
<volume>1</volume> (<issue>1</issue>) (<year>2015</year>) <fpage>16</fpage>&#x02013;<lpage>34</lpage>, <pub-id pub-id-type="doi">10.1109/TBDATA.2015.2465959</pub-id><comment>.</comment></mixed-citation></ref><ref id="R27"><label>[27]</label><mixed-citation publication-type="journal"><name><surname>Zheng</surname><given-names>Y</given-names></name>, <name><surname>Liu</surname><given-names>Y</given-names></name>, <name><surname>Yuan</surname><given-names>J</given-names></name>, <name><surname>Xie</surname><given-names>X</given-names></name>, <article-title>Urban computing with taxicabs</article-title>, <source>Proceedings of the 13th International Conference on Ubiquitous Computing</source>, <year>2011</year>, pp. <fpage>89</fpage>&#x02013;<lpage>98</lpage>, , <pub-id pub-id-type="doi">10.1145/2030112.2030126</pub-id><comment>.</comment></mixed-citation></ref><ref id="R28"><label>[28]</label><mixed-citation publication-type="journal"><name><surname>Wen</surname><given-names>W</given-names></name>, <name><surname>Wu</surname><given-names>C</given-names></name>, <name><surname>Wang</surname><given-names>Y</given-names></name>, <name><surname>Chen</surname><given-names>Y</given-names></name>, <name><surname>Li</surname><given-names>H</given-names></name>, <article-title>Learning structured sparsity in deep neural networks</article-title>, <source>Advances in Neural Information Processing Systems</source>, <year>2016</year>, pp. <fpage>2074</fpage>&#x02013;<lpage>2082</lpage>
<comment><ext-link ext-link-type="uri" xlink:href="https://papers.nips.cc/paper/6504-learning-structured-sparsity-indeep-neural-networks.pdf">https://papers.nips.cc/paper/6504-learning-structured-sparsity-indeep-neural-networks.pdf</ext-link>.</comment></mixed-citation></ref><ref id="R29"><label>[29]</label><mixed-citation publication-type="journal"><name><surname>Sorber</surname><given-names>L</given-names></name>, <name><surname>Van Barel</surname><given-names>M</given-names></name>, <name><surname>De Lathauwer</surname><given-names>L</given-names></name>, <article-title>Structured data fusion</article-title>, <source>IEEE J. Sel. Top. Signal Process</source>
<volume>9</volume> (<issue>4</issue>) (<year>2015</year>) <fpage>586</fpage>&#x02013;<lpage>600</lpage>, <pub-id pub-id-type="doi">10.1109/JSTSP.2015.2400415</pub-id><comment>.</comment></mixed-citation></ref><ref id="R30"><label>[30]</label><mixed-citation publication-type="journal"><name><surname>Papalexakis</surname><given-names>EE</given-names></name>, <name><surname>Faloutsos</surname><given-names>C</given-names></name>, <name><surname>Sidiropoulos</surname><given-names>ND</given-names></name>, <article-title>Tensors for data mining and data fusion: models, applications, and scalable algorithms</article-title>, <source>ACM Trans. Intel. Syst. Technol</source>
<volume>8</volume> (<issue>2</issue>) (<year>2017</year>) <fpage>16</fpage>, <pub-id pub-id-type="doi">10.1145/2915921</pub-id><comment>.</comment></mixed-citation></ref><ref id="R31"><label>[31]</label><mixed-citation publication-type="confproc"><name><surname>Dauwels</surname><given-names>J</given-names></name>, <name><surname>Garg</surname><given-names>L</given-names></name>, <name><surname>Earnest</surname><given-names>A</given-names></name>, <name><surname>Pang</surname><given-names>LK</given-names></name>, <source>Tensor factorization for missing data imputation in medical questionnaires</source>, <conf-name>2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</conf-name>, <year>2012</year>, pp. <fpage>2109</fpage>&#x02013;<lpage>2112</lpage>, , <pub-id pub-id-type="doi">10.1109/ICASSP.2012.6288327</pub-id><comment>.</comment></mixed-citation></ref><ref id="R32"><label>[32]</label><mixed-citation publication-type="journal"><name><surname>Xie</surname><given-names>K</given-names></name>, <name><surname>Li</surname><given-names>X</given-names></name>, <name><surname>Wang</surname><given-names>X</given-names></name>, <name><surname>Xie</surname><given-names>G</given-names></name>, <name><surname>Wen</surname><given-names>J</given-names></name>, <name><surname>Cao</surname><given-names>J</given-names></name>, <name><surname>Zhang</surname><given-names>D</given-names></name>, <article-title>Fast tensor factorization for accurate internet anomaly detection</article-title>, <source>IEEE/ACM Trans. Network. (TON)</source>
<volume>25</volume> (<issue>6</issue>) (<year>2017</year>) <fpage>3794</fpage>&#x02013;<lpage>3807</lpage>, <pub-id pub-id-type="doi">10.1109/TNET.2017.2761704</pub-id><comment>.</comment></mixed-citation></ref><ref id="R33"><label>[33]</label><mixed-citation publication-type="confproc"><name><surname>Charlier</surname><given-names>J</given-names></name>, <name><surname>State</surname><given-names>R</given-names></name>, <name><surname>Hilger</surname><given-names>J</given-names></name>, <source>Non-negative paratuck2 tensor decomposition combined to lstm network for smart contracts profiling</source>, <conf-name>2018 IEEE International Conference on Big Data and Smart Computing (BigComp)</conf-name>, <year>2018</year>, pp. <fpage>74</fpage>&#x02013;<lpage>81</lpage>, ,<pub-id pub-id-type="doi">10.1109/BigComp.2018.00020</pub-id><comment>.</comment></mixed-citation></ref><ref id="R34"><label>[34]</label><mixed-citation publication-type="confproc"><name><surname>Xiong</surname><given-names>L</given-names></name>, <name><surname>Chen</surname><given-names>X</given-names></name>, <name><surname>Huang</surname><given-names>T-K</given-names></name>, <name><surname>Schneider</surname><given-names>J</given-names></name>, <name><surname>Carbonell</surname><given-names>JG</given-names></name>, <source>Temporal collaborative filtering with bayesian probabilistic tensor factorization</source>, <conf-name>Proceedings of the 2010 SIAM International Conference on Data Mining</conf-name>, <year>2010</year>, pp. <fpage>211</fpage>&#x02013;<lpage>222</lpage>, , <pub-id pub-id-type="doi">10.1137/1.9781611972801.19</pub-id><comment>.</comment></mixed-citation></ref><ref id="R35"><label>[35]</label><mixed-citation publication-type="journal"><name><surname>Dunlavy</surname><given-names>DM</given-names></name>, <name><surname>Kolda</surname><given-names>TG</given-names></name>, <name><surname>Acar</surname><given-names>E</given-names></name>, <etal/>, <source>ACM Trans. Knowl. Discov. Data (TKDD) 5</source> (<issue>2</issue>) (<year>2011</year>) <fpage>10</fpage>, <pub-id pub-id-type="doi">10.1145/1921632.1921636</pub-id><comment>.</comment></mixed-citation></ref><ref id="R36"><label>[36]</label><mixed-citation publication-type="journal"><name><surname>&#x0017d;itnik</surname><given-names>M</given-names></name>, <name><surname>Zupan</surname><given-names>B</given-names></name>, <article-title>Data fusion by matrix factorization</article-title>, <source>IEEE Trans. Pattern Anal. Mach. Intell</source>
<volume>37</volume> (<issue>1</issue>) (<year>2014</year>) <fpage>41</fpage>&#x02013;<lpage>53</lpage>, <pub-id pub-id-type="doi">10.1109/TPAMI.2014.2343973</pub-id><comment>.</comment></mixed-citation></ref><ref id="R37"><label>[37]</label><mixed-citation publication-type="journal"><name><surname>Kolda</surname><given-names>TG</given-names></name>, <name><surname>Bader</surname><given-names>BW</given-names></name>, <article-title>Tensor decompositions and applications</article-title>, <source>SIAM Rev</source>. <volume>51</volume> (<issue>3</issue>) (<year>2009</year>) <fpage>455</fpage>&#x02013;<lpage>500</lpage>, <pub-id pub-id-type="doi">10.1137/07070111X</pub-id><comment>.</comment></mixed-citation></ref><ref id="R38"><label>[38]</label><mixed-citation publication-type="journal"><name><surname>Vervliet</surname><given-names>N</given-names></name>, <name><surname>Debals</surname><given-names>O</given-names></name>, <name><surname>Sorber</surname><given-names>L</given-names></name>, <name><surname>De Lathauwer</surname><given-names>L</given-names></name>, <article-title>Breaking the curse of dimensionality using decompositions of incomplete tensors: tensor-based scientific computing in big data analysis</article-title>, <source>IEEE Signal Process. Mag</source>
<volume>31</volume> (<issue>5</issue>) (<year>2014</year>) <fpage>71</fpage>&#x02013;<lpage>79</lpage>, <pub-id pub-id-type="doi">10.1109/MSP.2014.2329429</pub-id><comment>.</comment></mixed-citation></ref><ref id="R39"><label>[39]</label><mixed-citation publication-type="journal"><name><surname>Acar</surname><given-names>E</given-names></name>, <name><surname>Dunlavy</surname><given-names>DM</given-names></name>, <name><surname>Kolda</surname><given-names>TG</given-names></name>, <name><surname>M&#x000f8;rup</surname><given-names>M</given-names></name>, <article-title>Scalable tensor factorizations for incomplete data</article-title>, <source>Chemom. Intell. Lab. Syst</source>
<volume>106</volume> (<issue>1</issue>) (<year>2011</year>) <fpage>41</fpage>&#x02013;<lpage>56</lpage>, <pub-id pub-id-type="doi">10.1016/j.chemolab.2010.08.004</pub-id><comment>.</comment></mixed-citation></ref><ref id="R40"><label>[40]</label><mixed-citation publication-type="journal"><name><surname>Asif</surname><given-names>MT</given-names></name>, <name><surname>Mitrovic</surname><given-names>N</given-names></name>, <name><surname>Dauwels</surname><given-names>J</given-names></name>, <name><surname>Jaillet</surname><given-names>P</given-names></name>, <article-title>Matrix and tensor based methods for missing data estimation in large traffic networks</article-title>, <source>IEEE Trans. Intell. Transp. Syst</source>
<volume>17</volume> (<issue>7</issue>) (<year>2016</year>) <fpage>1816</fpage>&#x02013;<lpage>1825</lpage>, <pub-id pub-id-type="doi">10.1109/TITS.2015.2507259</pub-id><comment>.</comment></mixed-citation></ref><ref id="R41"><label>[41]</label><mixed-citation publication-type="journal"><name><surname>Kolda</surname><given-names>TG</given-names></name>, <name><surname>Sun</surname><given-names>J</given-names></name>, <article-title>Scalable tensor decompositions for multi-aspect data mining, 2008 Eighth IEEE International Conference on Data Mining</article-title>, <source>IEEE</source>, <year>2008</year>, pp. <fpage>363</fpage>&#x02013;<lpage>372</lpage>, , <pub-id pub-id-type="doi">10.1109/ICDM.2008.89</pub-id><comment>.</comment></mixed-citation></ref><ref id="R42"><label>[42]</label><mixed-citation publication-type="journal"><name><surname>Breloff</surname><given-names>SP</given-names></name>, <name><surname>Dutta</surname><given-names>A</given-names></name>, <name><surname>Dai</surname><given-names>F</given-names></name>, <name><surname>Sinsel</surname><given-names>EW</given-names></name>, <name><surname>Warren</surname><given-names>CM</given-names></name>, <name><surname>Ning</surname><given-names>X</given-names></name>, <name><surname>Wu</surname><given-names>JZ</given-names></name>, <article-title>Assessing work-related risk factors for musculoskeletal knee disorders in construction roofing tasks</article-title>, <source>Appl. Ergon</source>
<volume>81</volume> (<year>2019</year>) <fpage>102901</fpage>, <pub-id pub-id-type="doi">10.1016/j.apergo.2019.102901</pub-id><comment>.</comment><pub-id pub-id-type="pmid">31422268</pub-id></mixed-citation></ref><ref id="R43"><label>[43]</label><mixed-citation publication-type="journal"><name><surname>Dutta</surname><given-names>A</given-names></name>, <name><surname>Breloff</surname><given-names>SP</given-names></name>, <name><surname>Dai</surname><given-names>F</given-names></name>, <name><surname>Sinsel</surname><given-names>EW</given-names></name>, <name><surname>Warren</surname><given-names>CM</given-names></name>, <name><surname>Carey</surname><given-names>RE</given-names></name>, <name><surname>Wu</surname><given-names>JZ</given-names></name>, <article-title>Effects of Working Posture and Roof Slope on Activation of Lower Limb Muscles during Shingle Installation</article-title>, <source>Ergonomics</source>, <volume>1-12</volume> (<year>2020</year>), pp. <fpage>1</fpage>&#x02013;<lpage>7</lpage>, <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/00140139.2020.1772378">https://doi.org/10.1080/00140139.2020.1772378</ext-link><ext-link ext-link-type="uri" xlink:href="https://www.tandfonline.com/doi/abs/10.1080/00140139.2020.1772378?journalCode=terg20">https://www.tandfonline.com/doi/abs/10.1080/00140139.2020.1772378?journalCode=terg20</ext-link><comment>https://www.tandfonline.com/doi/abs/10.1080/00140139.2020.1772378?journalCode=terg20 .</comment></mixed-citation></ref><ref id="R44"><label>[44]</label><mixed-citation publication-type="journal"><name><surname>Kingston</surname><given-names>DC</given-names></name>, <name><surname>Tennant</surname><given-names>LM</given-names></name>, <name><surname>Chong</surname><given-names>HC</given-names></name>, <name><surname>Acker</surname><given-names>SM</given-names></name>, <article-title>Peak activation of lower limb musculature during high flexion kneeling and transitional movements</article-title>, <source>Ergonomics</source>
<volume>59</volume> (<issue>9</issue>) (<year>2016</year>) <fpage>1215</fpage>&#x02013;<lpage>1223</lpage>, <pub-id pub-id-type="doi">10.1080/00140139.2015.1130861</pub-id><comment>.</comment><pub-id pub-id-type="pmid">26923936</pub-id></mixed-citation></ref><ref id="R45"><label>[45]</label><mixed-citation publication-type="journal"><name><surname>Hofer</surname><given-names>JK</given-names></name>, <name><surname>Gejo</surname><given-names>R</given-names></name>, <name><surname>McGarry</surname><given-names>MH</given-names></name>, <name><surname>Lee</surname><given-names>TQ</given-names></name>, <article-title>Effects on tibiofemoral biomechanics from kneeling</article-title>, <source>Clin. Biomech</source>
<volume>26</volume> (<issue>6</issue>) (<year>2011</year>) <fpage>605</fpage>&#x02013;<lpage>611</lpage>, <pub-id pub-id-type="doi">10.1016/j.clinbiomech.2011.01.016</pub-id><comment>.</comment></mixed-citation></ref><ref id="R46"><label>[46]</label><mixed-citation publication-type="journal"><name><surname>Pavlidis</surname><given-names>P</given-names></name>, <name><surname>Weston</surname><given-names>J</given-names></name>, <name><surname>Cai</surname><given-names>J</given-names></name>, <name><surname>Noble</surname><given-names>WS</given-names></name>, <article-title>Learning gene functional classifications from multiple data types</article-title>, <source>J. Comput. Biol</source>
<volume>9</volume> (<issue>2</issue>) (<year>2002</year>) <fpage>401</fpage>&#x02013;<lpage>411</lpage>, <pub-id pub-id-type="doi">10.1089/10665270252935539</pub-id><comment>.</comment><pub-id pub-id-type="pmid">12015889</pub-id></mixed-citation></ref><ref id="R47"><label>[47]</label><mixed-citation publication-type="journal"><name><surname>Bader</surname><given-names>B</given-names></name>, <name><surname>Kolda</surname><given-names>T</given-names></name>, <source>MATLAB tensor toolbox version 2.4</source>,<comment>&#x0003c;<ext-link ext-link-type="uri" xlink:href="https://www.sandia.gov/~tgkolda/TensorToolbox/index-2.6.html">https://www.sandia.gov/~tgkolda/TensorToolbox/index-2.6.html</ext-link>&#x0003e;</comment>(<comment>May 14</comment>, <year>2019</year>).</mixed-citation></ref><ref id="R48"><label>[48]</label><mixed-citation publication-type="journal"><name><surname>Bro</surname><given-names>R</given-names></name>, <name><surname>Kiers</surname><given-names>HA</given-names></name>, <article-title>A new efficient method for determining the number of components in PARAFAC models</article-title>, <source>J. Chemom</source>
<volume>17</volume> (<issue>5</issue>) (<year>2003</year>) <fpage>274</fpage>&#x02013;<lpage>286</lpage>, <pub-id pub-id-type="doi">10.1002/cem.801</pub-id><comment>.</comment></mixed-citation></ref><ref id="R49"><label>[49]</label><mixed-citation publication-type="journal"><name><surname>Andersson</surname><given-names>CA</given-names></name>, <name><surname>Bro</surname><given-names>R</given-names></name>, <article-title>The N-way toolbox for MATLAB</article-title>, <source>Chemom. Intell. Lab. Syst</source>
<volume>52</volume> (<issue>1</issue>) (<year>2000</year>) <fpage>1</fpage>&#x02013;<lpage>4</lpage>, <pub-id pub-id-type="doi">10.1016/S0169-7439(00)00071-X</pub-id><comment>.</comment></mixed-citation></ref><ref id="R50"><label>[50]</label><mixed-citation publication-type="journal"><name><surname>Debals</surname><given-names>O</given-names></name>, <name><surname>Van Eeghem</surname><given-names>F</given-names></name>, <name><surname>Vervliet</surname><given-names>N</given-names></name>, <name><surname>De Lathauwer</surname><given-names>L</given-names></name>, <source>Tensor computations using Tensorlab</source>, <comment><ext-link ext-link-type="uri" xlink:href="https://www.tensorlab.net/demos/tutorial.pdf">https://www.tensorlab.net/demos/tutorial.pdf</ext-link>&#x0003e;</comment>, (<comment>May 19</comment>, <year>2019</year>).</mixed-citation></ref><ref id="R51"><label>[51]</label><mixed-citation publication-type="journal"><name><surname>Alimuddin</surname><given-names>I</given-names></name>, <name><surname>Sumantyo</surname><given-names>JTS</given-names></name>, <name><surname>Kuze</surname><given-names>H</given-names></name>, <article-title>Assessment of pan-sharpening methods applied to image fusion of remotely sensed multi-band data</article-title>, <source>Int. J. Appl. Earth Obs. Geoinf</source>
<volume>18</volume> (<year>2012</year>) <fpage>165</fpage>&#x02013;<lpage>175</lpage>, <pub-id pub-id-type="doi">10.1016/j.jag.2012.01.013</pub-id><comment>.</comment></mixed-citation></ref><ref id="R52"><label>[52]</label><mixed-citation publication-type="journal"><name><surname>Ng</surname><given-names>KB</given-names></name>, <name><surname>Kantor</surname><given-names>PB</given-names></name>, <article-title>Predicting the effectiveness of naive data fusion on the basis of system characteristics</article-title>, <source>J. Am. Soc. Inf. Sci</source>
<volume>51</volume> (<issue>13</issue>) (<year>2000</year>) <fpage>1177</fpage>&#x02013;<lpage>1189</lpage>, <pub-id pub-id-type="doi">10.1002/1097-4571(2000)9999:9999&#x0003c;::AID-ASI1030&#x0003e;3.0.CO;2-E</pub-id><comment>.</comment></mixed-citation></ref><ref id="R53"><label>[53]</label><mixed-citation publication-type="journal"><name><surname>Alexander</surname><given-names>DL</given-names></name>, <name><surname>Tropsha</surname><given-names>A</given-names></name>, <name><surname>Winkler</surname><given-names>DA</given-names></name>, <article-title>Beware of R 2: simple, unambiguous assessment of the prediction accuracy of QSAR and QSPR models</article-title>, <source>J. Chem. Inf. Model</source>
<volume>55</volume> (<issue>7</issue>) (<year>2015</year>) <fpage>1316</fpage>&#x02013;<lpage>1322</lpage>, <pub-id pub-id-type="doi">10.1021/acs.jcim.5b00206</pub-id><comment>.</comment><pub-id pub-id-type="pmid">26099013</pub-id></mixed-citation></ref><ref id="R54"><label>[54]</label><mixed-citation publication-type="journal"><name><surname>Khaleghi</surname><given-names>B</given-names></name>, <name><surname>Khamis</surname><given-names>A</given-names></name>, <name><surname>Karray</surname><given-names>FO</given-names></name>, <name><surname>Razavi</surname><given-names>SN</given-names></name>, <article-title>Multisensor data fusion: a review of the state-of-the-art</article-title>, <source>Inform. Fusion</source>
<volume>14</volume> (<issue>1</issue>) (<year>2013</year>) <fpage>28</fpage>&#x02013;<lpage>44</lpage>, <pub-id pub-id-type="doi">10.1016/j.inffus.2011.08.001</pub-id><comment>.</comment></mixed-citation></ref></ref-list></back><floats-group><fig id="F1" orientation="portrait" position="float"><label>Fig. 1.</label><caption><p id="P71">CPD tensor decomposition.</p></caption><graphic xlink:href="nihms-1653730-f0001"/></fig><fig id="F2" orientation="portrait" position="float"><label>Fig. 2.</label><caption><p id="P72">Overview of proposed method.</p></caption><graphic xlink:href="nihms-1653730-f0002"/></fig><fig id="F3" orientation="portrait" position="float"><label>Fig. 3.</label><caption><p id="P73">Schematic representation of DS3 as a 3D tensor <italic><bold>M</bold></italic>.</p></caption><graphic xlink:href="nihms-1653730-f0003"/></fig><fig id="F4" orientation="portrait" position="float"><label>Fig. 4.</label><caption><p id="P74">Schematic illustration of CPD decomposition of the 3D tensor <italic><bold>M</bold></italic>.</p></caption><graphic xlink:href="nihms-1653730-f0004"/></fig><fig id="F5" orientation="portrait" position="float"><label>Fig. 5.</label><caption><p id="P75">Representation of the five-dimensional tensor <italic><bold>T</bold></italic>.</p></caption><graphic xlink:href="nihms-1653730-f0005"/></fig><fig id="F6" orientation="portrait" position="float"><label>Fig. 6.</label><caption><p id="P76">CPD tensor decomposition of tensor <italic><bold>T</bold></italic>.</p></caption><graphic xlink:href="nihms-1653730-f0006"/></fig><fig id="F7" orientation="portrait" position="float"><label>Fig. 7.</label><caption><p id="P77">RMSE curves for 10%, 20%, 30%, 40%, 50%, 60% and 70% missing data from rotation dataset.</p></caption><graphic xlink:href="nihms-1653730-f0007"/></fig><fig id="F8" orientation="portrait" position="float"><label>Fig. 8.</label><caption><p id="P78">MAE curves for 10%, 20%, 30%, 40%, 50%, 60% and 70% missing data from rotation dataset.</p></caption><graphic xlink:href="nihms-1653730-f0008"/></fig><fig id="F9" orientation="portrait" position="float"><label>Fig. 9.</label><caption><p id="P79">Consistency curves for 10%, 20%, 30%, 40%, 50%, 60% and 70% missing data from rotation DS.</p></caption><graphic xlink:href="nihms-1653730-f0009"/></fig><table-wrap id="T1" position="float" orientation="portrait"><label>Table 1</label><caption><p id="P80">Formats of DS1 and DS2.</p></caption><table frame="hsides" rules="groups"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="middle" rowspan="1" colspan="1">Dataset (DS)</th><th align="left" valign="middle" rowspan="1" colspan="1">Tabular format</th></tr></thead><tbody><tr><td align="left" valign="middle" rowspan="1" colspan="1">DS1</td><td align="left" valign="middle" rowspan="1" colspan="1"><italic>subject</italic>, <italic>setting</italic><sub>1</sub>, <italic>setting</italic><sub>2</sub>, &#x02026;, <italic>setting</italic><sub><italic>q</italic></sub>, <italic>ind</italic><sub>1</sub><sup>1</sup>, <italic>ind</italic><sub>1</sub><sup>2</sup>, &#x02026;, <italic>ind</italic><sub>1</sub><sup><italic>m</italic></sup></td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">DS2</td><td align="left" valign="middle" rowspan="1" colspan="1"><italic>subject</italic>, <italic>setting</italic><sub>1</sub>, <italic>setting</italic><sub>2</sub>, &#x02026;, <italic>setting</italic><sub><italic>q</italic></sub>, <italic>ind</italic><sub>2</sub><sup>1</sup>, <italic>ind</italic><sub>2</sub><sup>2</sup>, &#x02026;, <italic>ind</italic><sub>1</sub><sup>n</sup></td></tr></tbody></table></table-wrap><table-wrap id="T2" position="float" orientation="portrait"><label>Table 2</label><caption><p id="P81">Format of DS3.</p></caption><table frame="hsides" rules="groups"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="middle" rowspan="1" colspan="1">Dataset (DS)</th><th align="left" valign="middle" rowspan="1" colspan="1">Format</th></tr></thead><tbody><tr><td align="left" valign="middle" rowspan="1" colspan="1">DS3</td><td align="left" valign="middle" rowspan="1" colspan="1"><italic>subject</italic>, <italic>setting</italic><sub>1</sub>, <italic>setting</italic><sub>2</sub>, &#x02026;, <italic>setting</italic><sub><italic>q</italic></sub>, <italic>ind</italic><sub>1</sub><sup>1</sup>, &#x02026;, <italic>ind</italic><sub>1</sub><sup><italic>m</italic></sup>, <italic>ind</italic><sub>2</sub><sup>1</sup>, &#x02026;, <italic>ind</italic><sub>2</sub><sup><italic>n</italic></sup></td></tr></tbody></table></table-wrap><table-wrap id="T3" position="float" orientation="portrait"><label>Table 3</label><caption><p id="P82">Proportion of missing data.</p></caption><table frame="hsides" rules="groups"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="middle" rowspan="1" colspan="1">Datasets</th><th colspan="7" align="left" valign="middle" rowspan="1">Missing proportion (%)</th></tr></thead><tbody><tr><td align="left" valign="middle" rowspan="1" colspan="1">Knee kinematics data</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">Knee EMG data</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td></tr></tbody></table></table-wrap><table-wrap id="T4" position="float" orientation="portrait"><label>Table 4</label><caption><p id="P83">Incomplete datasets for tensor construction.</p></caption><table frame="hsides" rules="groups"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th align="left" valign="middle" rowspan="1" colspan="1">Missing proportion from kinematics data (%)</th><th colspan="7" align="left" valign="top" rowspan="1">Missing proportion from EMG data (%)</th></tr></thead><tbody><tr><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td></tr></tbody></table></table-wrap><table-wrap id="T5" position="float" orientation="portrait"><label>Table 5</label><caption><p id="P84">Summary of results.</p></caption><table frame="hsides" rules="groups"><colgroup span="1"><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/><col align="left" valign="middle" span="1"/></colgroup><thead><tr><th colspan="2" align="left" valign="middle" rowspan="1">Missing proportion (%)<hr/></th><th align="left" valign="top" rowspan="1" colspan="1">RMSE</th><th align="left" valign="top" rowspan="1" colspan="1">MAE</th><th align="left" valign="top" rowspan="1" colspan="1">Con (%)</th><th align="left" valign="top" rowspan="1" colspan="1"><italic>R</italic></th><th align="left" valign="top" rowspan="1" colspan="1">Core consistency (%)</th></tr><tr><th align="left" valign="middle" rowspan="1" colspan="1">Knee rotation</th><th align="left" valign="middle" rowspan="1" colspan="1">EMG</th><th align="left" valign="middle" rowspan="1" colspan="1"/><th align="left" valign="middle" rowspan="1" colspan="1"/><th align="left" valign="middle" rowspan="1" colspan="1"/><th align="left" valign="middle" rowspan="1" colspan="1"/><th align="left" valign="middle" rowspan="1" colspan="1"/></tr></thead><tbody><tr><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">1.39</td><td align="left" valign="middle" rowspan="1" colspan="1">0.69</td><td align="left" valign="middle" rowspan="1" colspan="1">80</td><td align="left" valign="middle" rowspan="1" colspan="1">17</td><td align="left" valign="middle" rowspan="1" colspan="1">98</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">1.38</td><td align="left" valign="middle" rowspan="1" colspan="1">0.8192</td><td align="left" valign="middle" rowspan="1" colspan="1">80</td><td align="left" valign="middle" rowspan="1" colspan="1">14</td><td align="left" valign="middle" rowspan="1" colspan="1">96.5</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">1.39</td><td align="left" valign="middle" rowspan="1" colspan="1">0.818</td><td align="left" valign="middle" rowspan="1" colspan="1">80</td><td align="left" valign="middle" rowspan="1" colspan="1">15</td><td align="left" valign="middle" rowspan="1" colspan="1">97</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">1.4</td><td align="left" valign="middle" rowspan="1" colspan="1">0.8412</td><td align="left" valign="middle" rowspan="1" colspan="1">80</td><td align="left" valign="middle" rowspan="1" colspan="1">14</td><td align="left" valign="middle" rowspan="1" colspan="1">96</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">1.44</td><td align="left" valign="middle" rowspan="1" colspan="1">0.805</td><td align="left" valign="middle" rowspan="1" colspan="1">77</td><td align="left" valign="middle" rowspan="1" colspan="1">14</td><td align="left" valign="middle" rowspan="1" colspan="1">96.5</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">1.57</td><td align="left" valign="middle" rowspan="1" colspan="1">0.9597</td><td align="left" valign="middle" rowspan="1" colspan="1">77</td><td align="left" valign="middle" rowspan="1" colspan="1">12</td><td align="left" valign="middle" rowspan="1" colspan="1">95</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">2.34</td><td align="left" valign="middle" rowspan="1" colspan="1">1.47</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">94.67</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">1.67</td><td align="left" valign="middle" rowspan="1" colspan="1">0.9791</td><td align="left" valign="middle" rowspan="1" colspan="1">80</td><td align="left" valign="middle" rowspan="1" colspan="1">12</td><td align="left" valign="middle" rowspan="1" colspan="1">97</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">1.73</td><td align="left" valign="middle" rowspan="1" colspan="1">1.04</td><td align="left" valign="middle" rowspan="1" colspan="1">80</td><td align="left" valign="middle" rowspan="1" colspan="1">11</td><td align="left" valign="middle" rowspan="1" colspan="1">97</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">1.78</td><td align="left" valign="middle" rowspan="1" colspan="1">1.04</td><td align="left" valign="middle" rowspan="1" colspan="1">80</td><td align="left" valign="middle" rowspan="1" colspan="1">11</td><td align="left" valign="middle" rowspan="1" colspan="1">95.6</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">1.75</td><td align="left" valign="middle" rowspan="1" colspan="1">1.06</td><td align="left" valign="middle" rowspan="1" colspan="1">77</td><td align="left" valign="middle" rowspan="1" colspan="1">12</td><td align="left" valign="middle" rowspan="1" colspan="1">94.8</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">1.79</td><td align="left" valign="middle" rowspan="1" colspan="1">1.08</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">13</td><td align="left" valign="middle" rowspan="1" colspan="1">94</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">1.83</td><td align="left" valign="middle" rowspan="1" colspan="1">1.09</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">13</td><td align="left" valign="middle" rowspan="1" colspan="1">93</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">2.45</td><td align="left" valign="middle" rowspan="1" colspan="1">1.25</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">92.7</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">1.82</td><td align="left" valign="middle" rowspan="1" colspan="1">1</td><td align="left" valign="middle" rowspan="1" colspan="1">77</td><td align="left" valign="middle" rowspan="1" colspan="1">13</td><td align="left" valign="middle" rowspan="1" colspan="1">97</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">2</td><td align="left" valign="middle" rowspan="1" colspan="1">1.14</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">11</td><td align="left" valign="middle" rowspan="1" colspan="1">95.6</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">2.14</td><td align="left" valign="middle" rowspan="1" colspan="1">1.17</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">11</td><td align="left" valign="middle" rowspan="1" colspan="1">96</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">2.19</td><td align="left" valign="middle" rowspan="1" colspan="1">1.25</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">95</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">2.26</td><td align="left" valign="middle" rowspan="1" colspan="1">1.29</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">95.7</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">2.32</td><td align="left" valign="middle" rowspan="1" colspan="1">1.24</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">93.5</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">5.6</td><td align="left" valign="middle" rowspan="1" colspan="1">3.4</td><td align="left" valign="middle" rowspan="1" colspan="1">57</td><td align="left" valign="middle" rowspan="1" colspan="1">11</td><td align="left" valign="middle" rowspan="1" colspan="1">94</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">2.11</td><td align="left" valign="middle" rowspan="1" colspan="1">1.11</td><td align="left" valign="middle" rowspan="1" colspan="1">87</td><td align="left" valign="middle" rowspan="1" colspan="1">12</td><td align="left" valign="middle" rowspan="1" colspan="1">96.5</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">2.95</td><td align="left" valign="middle" rowspan="1" colspan="1">1.32</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">94</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">2.98</td><td align="left" valign="middle" rowspan="1" colspan="1">1.31</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">95</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">2.52</td><td align="left" valign="middle" rowspan="1" colspan="1">1.24</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">12</td><td align="left" valign="middle" rowspan="1" colspan="1">93.7</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">2.83</td><td align="left" valign="middle" rowspan="1" colspan="1">1.3</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">12</td><td align="left" valign="middle" rowspan="1" colspan="1">92.76</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">2.7</td><td align="left" valign="middle" rowspan="1" colspan="1">1.39</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">11</td><td align="left" valign="middle" rowspan="1" colspan="1">93</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">2.77</td><td align="left" valign="middle" rowspan="1" colspan="1">1.51</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">13</td><td align="left" valign="middle" rowspan="1" colspan="1">92</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">4.9</td><td align="left" valign="middle" rowspan="1" colspan="1">1.69</td><td align="left" valign="middle" rowspan="1" colspan="1">80</td><td align="left" valign="middle" rowspan="1" colspan="1">22</td><td align="left" valign="middle" rowspan="1" colspan="1">93</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">5.11</td><td align="left" valign="middle" rowspan="1" colspan="1">1.81</td><td align="left" valign="middle" rowspan="1" colspan="1">80</td><td align="left" valign="middle" rowspan="1" colspan="1">31</td><td align="left" valign="middle" rowspan="1" colspan="1">93</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">5.28</td><td align="left" valign="middle" rowspan="1" colspan="1">1.99</td><td align="left" valign="middle" rowspan="1" colspan="1">80</td><td align="left" valign="middle" rowspan="1" colspan="1">29</td><td align="left" valign="middle" rowspan="1" colspan="1">92.6</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">6.19</td><td align="left" valign="middle" rowspan="1" colspan="1">2.24</td><td align="left" valign="middle" rowspan="1" colspan="1">77</td><td align="left" valign="middle" rowspan="1" colspan="1">34</td><td align="left" valign="middle" rowspan="1" colspan="1">92</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">6</td><td align="left" valign="middle" rowspan="1" colspan="1">2.4</td><td align="left" valign="middle" rowspan="1" colspan="1">77</td><td align="left" valign="middle" rowspan="1" colspan="1">43</td><td align="left" valign="middle" rowspan="1" colspan="1">91.5</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">6.5</td><td align="left" valign="middle" rowspan="1" colspan="1">2.7</td><td align="left" valign="middle" rowspan="1" colspan="1">77</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">92</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">6.8</td><td align="left" valign="middle" rowspan="1" colspan="1">3.8</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">91</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">3.38</td><td align="left" valign="middle" rowspan="1" colspan="1">1.45</td><td align="left" valign="middle" rowspan="1" colspan="1">84</td><td align="left" valign="middle" rowspan="1" colspan="1">12</td><td align="left" valign="middle" rowspan="1" colspan="1">93</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">4.4</td><td align="left" valign="middle" rowspan="1" colspan="1">1.76</td><td align="left" valign="middle" rowspan="1" colspan="1">77</td><td align="left" valign="middle" rowspan="1" colspan="1">15</td><td align="left" valign="middle" rowspan="1" colspan="1">93.6</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">4.63</td><td align="left" valign="middle" rowspan="1" colspan="1">1.8</td><td align="left" valign="middle" rowspan="1" colspan="1">77</td><td align="left" valign="middle" rowspan="1" colspan="1">13</td><td align="left" valign="middle" rowspan="1" colspan="1">92</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">5</td><td align="left" valign="middle" rowspan="1" colspan="1">2</td><td align="left" valign="middle" rowspan="1" colspan="1">77</td><td align="left" valign="middle" rowspan="1" colspan="1">13</td><td align="left" valign="middle" rowspan="1" colspan="1">92.8</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">5.32</td><td align="left" valign="middle" rowspan="1" colspan="1">2.08</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">93</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">5.39</td><td align="left" valign="middle" rowspan="1" colspan="1">2.16</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">16</td><td align="left" valign="middle" rowspan="1" colspan="1">92</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">5.6</td><td align="left" valign="middle" rowspan="1" colspan="1">2.2</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">11</td><td align="left" valign="middle" rowspan="1" colspan="1">91</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">6.7</td><td align="left" valign="middle" rowspan="1" colspan="1">2.7</td><td align="left" valign="middle" rowspan="1" colspan="1">77</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">90</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">20</td><td align="left" valign="middle" rowspan="1" colspan="1">7.7</td><td align="left" valign="middle" rowspan="1" colspan="1">3.2</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">17</td><td align="left" valign="middle" rowspan="1" colspan="1">89</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">30</td><td align="left" valign="middle" rowspan="1" colspan="1">7.5</td><td align="left" valign="middle" rowspan="1" colspan="1">3</td><td align="left" valign="middle" rowspan="1" colspan="1">74</td><td align="left" valign="middle" rowspan="1" colspan="1">12</td><td align="left" valign="middle" rowspan="1" colspan="1">91.5</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">40</td><td align="left" valign="middle" rowspan="1" colspan="1">7.54</td><td align="left" valign="middle" rowspan="1" colspan="1">3.38</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">10</td><td align="left" valign="middle" rowspan="1" colspan="1">90</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">50</td><td align="left" valign="middle" rowspan="1" colspan="1">8</td><td align="left" valign="middle" rowspan="1" colspan="1">3.3</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">11</td><td align="left" valign="middle" rowspan="1" colspan="1">92.4</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">60</td><td align="left" valign="middle" rowspan="1" colspan="1">8.29</td><td align="left" valign="middle" rowspan="1" colspan="1">3.4</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">11</td><td align="left" valign="middle" rowspan="1" colspan="1">90</td></tr><tr><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">70</td><td align="left" valign="middle" rowspan="1" colspan="1">8.8</td><td align="left" valign="middle" rowspan="1" colspan="1">3.67</td><td align="left" valign="middle" rowspan="1" colspan="1">67</td><td align="left" valign="middle" rowspan="1" colspan="1">12</td><td align="left" valign="middle" rowspan="1" colspan="1">88</td></tr></tbody></table></table-wrap></floats-group></article>