<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Bioeng. Biotechnol.</journal-id>
<journal-title>Frontiers in Bioengineering and Biotechnology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Bioeng. Biotechnol.</abbrev-journal-title>
<issn pub-type="epub">2296-4185</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fbioe.2015.00065</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Bioengineering and Biotechnology</subject>
<subj-group>
<subject>Technology Report</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>The Use of Collision Detection to Infer Multi-Camera Calibration Quality</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Chong</surname> <given-names>Sook-Yee</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="cor1">&#x0002A;</xref>
<uri xlink:href="http://frontiersin.org/people/u/155451"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Dorow</surname> <given-names>Beate</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Ramasamy</surname> <given-names>Ellankavi</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://frontiersin.org/people/u/209640"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Dennerlein</surname> <given-names>Florian</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>R&#x000F6;hrle</surname> <given-names>Oliver</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://frontiersin.org/people/u/27963"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>SimTech Research Group on Continuum Biomechanics and Mechanobiology, Institute of Applied Mechanics (CE), University of Stuttgart</institution>, <addr-line>Stuttgart</addr-line>, <country>Germany</country></aff>
<aff id="aff2"><sup>2</sup><institution>Biomechatronic Systems, Fraunhofer IPA</institution>, <addr-line>Stuttgart</addr-line>, <country>Germany</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Matthew B. Panzer, University of Virginia, USA</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Xiaogai Li, Royal Institute of Technology, Sweden; Mariusz Ziejewski, North Dakota State University, USA</p></fn>
<corresp content-type="corresp" id="cor1">&#x0002A;Correspondence: Sook-Yee Chong, Institute of Applied Mechanics (CE), University of Stuttgart, Pfaffenwaldring 5a, Stuttgart 70569, Germany, <email>sy.chong&#x00040;simtech.uni-stuttgart.de</email></corresp>
<fn fn-type="other" id="fn001"><p>Specialty section: This article was submitted to Biomechanics, a section of the journal Frontiers in Bioengineering and Biotechnology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>12</day>
<month>05</month>
<year>2015</year>
</pub-date>
<pub-date pub-type="collection">
<year>2015</year>
</pub-date>
<volume>3</volume>
<elocation-id>65</elocation-id>
<history>
<date date-type="received">
<day>30</day>
<month>01</month>
<year>2015</year>
</date>
<date date-type="accepted">
<day>27</day>
<month>04</month>
<year>2015</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2015 Chong, Dorow, Ramasamy, Dennerlein and R&#x000F6;hrle.</copyright-statement>
<copyright-year>2015</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Optical motion capture systems are widely used in sports and medicine. The performance of these systems depends on, amongst other factors, the quality of the camera calibration process. This study proposes a technique to assess the accuracy of the extrinsic camera parameters, as estimated during calibration. This method relies on the fact that solid objects in the real world cannot possess a gap in between, nor interpenetrate, when in contact with each other. In our study, we used motion capture to track successive collisions of two solid moving objects. The motion of solid objects was simulated based on trajectories measured by a multi-camera system and geometric information acquired from computed tomography. The simulations were then used to determine the amount of overlap or gap between them. This technique also takes into account errors resulting from markers moving close to one another, and better replicates actual movements during motion capture. We propose that this technique of successively colliding two solid moving objects may provide a means of measuring calibration accuracy.</p>
</abstract>
<kwd-group>
<kwd>error analysis</kwd>
<kwd>collision detection</kwd>
<kwd>camera calibration</kwd>
<kwd>accuracy</kwd>
<kwd>kinematics</kwd>
</kwd-group>
<contract-num rid="cn002">306757</contract-num>
<contract-sponsor id="cn001">European Research Council (ERC)</contract-sponsor>
<contract-sponsor id="cn002">European Union&#x02019;s Seventh Framework Program (FP7/2007-2013)</contract-sponsor>
<contract-sponsor id="cn003">Fraunhofer Attract Program</contract-sponsor>
<counts>
<fig-count count="8"/>
<table-count count="2"/>
<equation-count count="2"/>
<ref-count count="14"/>
<page-count count="6"/>
<word-count count="3625"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="introduction">
<title>Introduction</title>
<p>Motion capture has a wide range of applications, including virtual reality, sports, and medicine. Each of these applications requires high accuracy so as to distinguish the pathological deficits from the normal, or to simulate an authentic virtual environment to train surgeons. One of the methods in motion capture is using automated three-dimensional (3D) reconstruction of moving skin markers to determine joint kinematics. A critical factor, however, affecting the accuracy of the kinematic data is the quality of the camera calibration. Errors may affect the accurate determination of joint centres, which, in turn, will affect the calculation of moments and powers at a joint.</p>
<p>The theory and process of deriving 3D positions of retro-reflective markers from several two-dimensional (2D) camera projections have been extensively studied (Brown, <xref ref-type="bibr" rid="B2">1971</xref>; Tsai, <xref ref-type="bibr" rid="B13">1987</xref>; Chen et al., <xref ref-type="bibr" rid="B3">1994</xref>; Zhang, <xref ref-type="bibr" rid="B14">2000</xref>). This was traditionally achieved by utilizing 2D reference points, whose 3D coordinates were then determined in a defined coordinate system. Calibration success, however, is dependent on factors such as calibration procedure, camera setup, and volume effects. A good calibration, therefore, is frequently achieved in a controlled environment such as in a gait laboratory.</p>
<p>In principle, accurate 3D measurements could be achieved in a conventional setting. Previous studies have reported good accuracy when commercially available camera systems were used in a predefined volume (Dorociak and Cuddeford, <xref ref-type="bibr" rid="B5">1995</xref>; Ehara et al., <xref ref-type="bibr" rid="B7">1997</xref>; Richards, <xref ref-type="bibr" rid="B10">1999</xref>; Papic et al., <xref ref-type="bibr" rid="B9">2004</xref>). Liu et al. (<xref ref-type="bibr" rid="B8">2007</xref>) further explored the accuracy of an optical system in the 0.5&#x02013;200&#x02009;&#x003BC;m range for really small tooth displacements. In these studies, deviations from known distances or angles between fixed markers were determined. This, however, does not reflect what happens in reality. One only has to consider the complicated movements of a multi-segmented human body to know that distances between markers are not constantly fixed while in motion.</p>
<p>The difficulty of calibrating multiple cameras simultaneously is increased if it was performed outdoors or in wide open spaces, where moving markers are too small for a clear and concise reconstruction and calculation of the calibration parameters (Barreto and Daniilidis, <xref ref-type="bibr" rid="B1">2004</xref>). The process of being able to capture movements accurately thus continues to evolve.</p>
<p>In this study, we have developed a procedure for quantifying calibration accuracy of a multi-camera system based on collision detection by using markers that move relatively to each other during the calibration procedure. Saini et al. (<xref ref-type="bibr" rid="B11">2009</xref>) had previously utilized this procedure. In their study, the geometry of a mandibular (lower) tooth obtained from computed tomography (CT) and motion capture data of natural chewing movements were used to automatically reconstruct the tooth&#x02019;s maxillary (upper) counterpart. We have, likewise, used motion capture to track successive collisions of two solid moving objects.</p>
<p>We simulated the recorded movement in Python programing language (<uri xlink:href="https://www.python.org/">https://www.python.org/</uri>), and represented the virtual objects in a voxel-based manner. One of the virtual objects served as reference, and the other as test object. In order to account for the fact that solid objects cannot penetrate each other, we progressively eroded the virtual test object by removing voxels every time it collided with the reference object. The volumetric difference between the initial and the final virtual object reflected the degree to which the simulation deviated from reality. This process, therefore, provided a measure of calibration accuracy.</p>
</sec>
<sec id="S2">
<title>Camera Calibration</title>
<p>Camera calibration is the process of reconstructing the transformation from points in a world coordinate system to their corresponding points in an image plane. The transformation can be represented by a 3&#x02009;&#x000D7;&#x02009;4 matrix T<sub>W2I</sub>, which is composed of intrinsic and extrinsic camera parameters. Intrinsic camera parameters (focal length, image center, aspect ratio, and distortion of the lens) characterize the camera&#x02019;s projection properties. Extrinsic camera parameters specify the orientation and position of the camera in the world coordinate system.</p>
<p>The transformation process is described by the following equation in homogeneous coordinates P<sub>I</sub>&#x02009;&#x0003D;&#x02009;(x, y, 1) for image points and P<sub>W</sub>&#x02009;&#x0003D;&#x02009;(X, Y, Z, 1) for real-world points:
<disp-formula id="E1"><mml:math id="M1"><mml:mtable columnalign="left" class="align-star"><mml:mtr><mml:mtd columnalign="right" class="align-odd"></mml:mtd><mml:mtd class="align-even"><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mtext>I</mml:mtext></mml:mrow></mml:msub><mml:mspace width="0.3em" class="nbsp"/><mml:mo class="MathClass-rel">&#x0003D;</mml:mo><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mtext>W2I</mml:mtext></mml:mrow></mml:msub><mml:mo class="MathClass-bin">&#x02217;</mml:mo><mml:mtext></mml:mtext><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mtext>W</mml:mtext></mml:mrow></mml:msub></mml:mtd><mml:mtd columnalign="right" class="align-label"></mml:mtd><mml:mtd class="align-label"><mml:mspace width="2em"/></mml:mtd></mml:mtr><mml:mtr><mml:mtd columnalign="right" class="align-odd"></mml:mtd><mml:mtd class="align-even"><mml:mtext>&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;</mml:mtext><mml:mo class="MathClass-rel">&#x0003D;</mml:mo><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mtext>C2I</mml:mtext></mml:mrow></mml:msub><mml:mo class="MathClass-bin">&#x02217;</mml:mo><mml:mtext></mml:mtext><mml:msub><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mtext>W2C</mml:mtext></mml:mrow></mml:msub><mml:mo class="MathClass-bin">&#x02217;</mml:mo><mml:mtext></mml:mtext><mml:msub><mml:mrow><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mtext>W</mml:mtext></mml:mrow></mml:msub><mml:mspace width="2em"/></mml:mtd><mml:mtd columnalign="right" class="align-label"></mml:mtd><mml:mtd class="align-label"><mml:mspace width="2em"/></mml:mtd></mml:mtr></mml:mtable></mml:math>
</disp-formula></p>
<p>T<sub>W2I</sub> is the product of the 3&#x02009;&#x000D7;&#x02009;3 intrinsic T<sub>C2I</sub> and the 3&#x02009;&#x000D7;&#x02009;4 extrinsic T<sub>W2C</sub> calibration matrixes.</p>
<p>In the field of motion analysis, the process of finding the intrinsic parameters (linearization) and the extrinsic parameters (calibration) is performed separately. One popular technique for linearization is proposed by Zhang (<xref ref-type="bibr" rid="B14">2000</xref>), which takes into account projection errors by having a camera observe a planar pattern in at least two different orientations. The intrinsic parameters are then optimized for all patterns simultaneously. One technique in determining the extrinsic parameters is the wand calibration method, where a wand comprising two markers at a known fixed distance is waved in the capture volume. The 2D image coordinates attained are then used to calculate the 3D coordinates using bundle adjustment (Triggs et al., <xref ref-type="bibr" rid="B12">2000</xref>).</p>
<p>Since the cameras in our study had already been linearized, we assumed that the intrinsic calibration parameters were known, and focused only on the camera extrinsic matrix (T<sub>W2C</sub>). The extrinsic parameter matrix T<sub>W2C</sub>&#x02009;&#x0003D;&#x02009;(R&#x0007C;t), which consisted of a 3&#x02009;&#x000D7;&#x02009;3 rotation matrix R and a 3&#x02009;&#x000D7;&#x02009;1 translation vector t, described the camera position relative to the world coordinate system. Here, we evaluated the accuracy of T<sub>W2C</sub> as estimated by the motion capture system during wand calibration.</p>
</sec>
<sec id="S3" sec-type="methods">
<title>Methods</title>
<sec id="S3-1">
<title>Experiment</title>
<p>We utilized two wooden cubes with lengths of 10&#x02009;cm. A total of five reflective markers were randomly attached to three of the six faces of the cube. Each cube was mounted on a pole for better handling (Figure <xref ref-type="fig" rid="F1">1</xref>).</p>
<fig position="float" id="F1">
<label>Figure 1</label>
<caption><p><bold>The two cubes used in the experiment</bold>.</p></caption>
<graphic xlink:href="fbioe-03-00065-g001.tif"/>
</fig>
<p>Three experiments were performed, in which the two cubes were moved relative to one another. This included random collisions and the rubbing of the marker-free faces from time to time. We ensured that entire faces were rubbed against each other. Nine cameras from Qualisys (Gothenburg, Sweden) 3D motion capture system were used to record 15&#x02009;s of the marker positions at a sampling frequency of 100&#x02009;Hz.</p>
</sec>
<sec id="S3-2">
<title>Registration and voxelization of the cubes</title>
<p>A Tomoscope HV500 (Werth Messtechnik GmbH, Rudersberg, Germany) CT scan was used to capture the geometry of the cubes as a point cloud with 1.2&#x02009;mm thick images. The positions of the markers&#x02019; midpoints were determined with an image processing software tool that specializes in extracting 3D regular geometric figures and spheres from point cloud data (Effenberger et al., <xref ref-type="bibr" rid="B6">2013</xref>).</p>
<p>We computed a tight-fitting oriented bounding box around each cube&#x02019;s point cloud and divided it into a 3D grid of regularly spaced voxels. The points and marker coordinates in the point cloud were then transformed to voxel indices N. Each cube was then represented as a bounding box of voxels with resolution N&#x0002A;N&#x0002A;N. Here, we assumed that each voxel is a perfect cuboid.</p>
</sec>
<sec id="S3-3">
<title>Simulation</title>
<p>The experiments were simulated using Python software, with the markers&#x02019; trajectories as measured by Qualisys, and the geometric information determined from the CT scan (Figure <xref ref-type="fig" rid="F2">2</xref>). As the simulated scene mirrored the real dynamics of the cubes during the experiments, the voxel representations of the reconstructed cubes, referred to as C<sub>R</sub> (reference) and C<sub>T</sub> (test object), should not overlap at any time. In the simulations, however, we observed an overlap of the cubes&#x02019; surfaces at certain points in time. These overlaps occurred when they were touching or very close to each other. Given that the cube geometries had been reliably captured by the CT scan, these occurrences were ascribed to inaccurate camera calibration. The simulation might, similarly, exhibit a gap between the cubes&#x02019; faces, even though in reality the cubes were in direct contact. The two types of deviations were then categorized as Type I and Type II errors (Table <xref ref-type="table" rid="T1">1</xref>).</p>
<fig position="float" id="F2">
<label>Figure 2</label>
<caption><p><bold>Reconstructed experiment from markers&#x02019; trajectories</bold>.</p></caption>
<graphic xlink:href="fbioe-03-00065-g002.tif"/>
</fig>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Types of errors during simulation</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Simulation</th>
<th align="left">Reality</th>
<th align="left">Error</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left"><inline-graphic xlink:href="fbioe-03-00065-i001.tif"/></td>
<td align="left"><inline-graphic xlink:href="fbioe-03-00065-i002.tif"/> or <inline-graphic xlink:href="fbioe-03-00065-i003.tif"/></td>
<td align="left">Type I</td>
</tr>
<tr>
<td align="left"><inline-graphic xlink:href="fbioe-03-00065-i004.tif"/></td>
<td align="left"><inline-graphic xlink:href="fbioe-03-00065-i005.tif"/></td>
<td align="left">Type II</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In order to measure the extent of Type I errors, the number of overlapping voxels between the two cubes was summed up for the entire time period. In order to account for Type II errors, a thin layer of voxels was added onto the contacting face of C<sub>T</sub>, where collision was induced during the experiments. Since we did collide and rub entire faces during the experiments, the outer voxel padding should be fully removed by the end of the simulated trial. The number of remaining outer voxels would thus indicate the magnitude of Type II error.</p>
<p>In the following sections, we will use the terms &#x0201C;Type I voxel&#x0201D; and &#x0201C;Type II voxel&#x0201D; to refer to voxels representing a Type I or Type II error respectively. Figure <xref ref-type="fig" rid="F3">3</xref> illustrates the respective procedures for detecting Type I and Type II errors.</p>
<fig position="float" id="F3">
<label>Figure 3</label>
<caption><p><bold>Error detection</bold>. <bold>(A)</bold> Voxelizing the cube&#x02019;s interior for overlap detection (Type I). <bold>(B)</bold> Voxelizing the cube&#x02019;s exterior for detection of spurious gaps (Type II).</p></caption>
<graphic xlink:href="fbioe-03-00065-g003.tif"/>
</fig>
<p>During the simulated collisions between C<sub>T</sub> and C<sub>R</sub>, both Type I and Type II voxels were determined. A local coordinate system with the origin specified at one corner was defined for each cube. Each voxel had 3D coordinates ranging from 1 to N, with N being the voxel resolution. The local coordinates of C<sub>R</sub> were then transformed to the coordinate system of C<sub>T</sub>. If the transformed coordinates of a voxel fall within the coordinate system of C<sub>T</sub>, this is marked as a Type I voxel. In a similar manner, to determine Type II voxels, 3D coordinates ranging from N&#x02009;&#x0002B;&#x02009;1 to N&#x02009;&#x0002B;&#x02009;d were defined for the padding, with d being the number of padding layers. If a voxel in the padding has coordinates that fall within the range of the coordinate system of C<sub>T</sub>, it is removed and unmarked as a Type II voxel.</p>
<p>This problem of collision detection could also be viewed as a matter of shape optimization (Saini et al., <xref ref-type="bibr" rid="B11">2009</xref>). Figures <xref ref-type="fig" rid="F4">4</xref>A&#x02013;C show the simplified 2D view of the adjusted shape of C<sub>T</sub> at time t<sub>0</sub>, t<sub>i</sub> and t<sub>j</sub> with 0&#x02009;&#x0003C;&#x02009;i&#x02009;&#x0003C;&#x02009;j. Here, the right side of C<sub>T</sub> was in contact with C<sub>R</sub>. Voxels which were untouched during the simulation were shown in white; they defined the geometry of C<sub>T</sub> at any given time. Deleted voxels were shown in gray. If the simulation of the cubes and their movements were perfect, the initial and final C<sub>T</sub> would be identical (Figure <xref ref-type="fig" rid="F5">5</xref>).</p>
<fig position="float" id="F4">
<label>Figure 4</label>
<caption><p><bold>Adjusted shape of C<sub>T</sub> as time progresses</bold>. Untouched voxels are shown in white, deleted ones in gray. The two layers of voxels on the right are the extra padding. <bold>(A)</bold> time t<sub>0</sub> <bold>(B)</bold> time t<sub>i</sub> <bold>(C)</bold> time t<sub>j</sub></p></caption>
<graphic xlink:href="fbioe-03-00065-g004.tif"/>
</fig>
<fig position="float" id="F5">
<label>Figure 5</label>
<caption><p><bold>Final shape of C<sub>T</sub> in the case of a perfect simulation</bold>.</p></caption>
<graphic xlink:href="fbioe-03-00065-g005.tif"/>
</fig>
<p>The volumetric difference between the initial and the final shape-optimized C<sub>T</sub> could be used to quantify the degree of agreement between simulation and reality. Clearly, the level of accuracy was dependent on the voxel size. The different voxel resolutions had thus been taken into account.</p>
<p>The total volumetric difference VolDiff<sub>N</sub><sup>(t)</sup> (cm<sup>3</sup>) at time <italic>t</italic> (ms) resulting from Type I and Type II errors depended on the voxel resolution of N&#x0002A;N&#x0002A;N, and was given by
<disp-formula id="E2"><label>(1)</label><mml:math id="M2"><mml:mrow><mml:msub><mml:mrow><mml:mtext>VolDiff</mml:mtext></mml:mrow><mml:mtext>N</mml:mtext></mml:msub><mml:msup><mml:mtext>&#x02009;</mml:mtext><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mtext>t</mml:mtext><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:msup><mml:mtext>&#x02009;</mml:mtext><mml:mo>=</mml:mo><mml:mo stretchy='false'>(</mml:mo><mml:msub><mml:mtext>n</mml:mtext><mml:mtext>I</mml:mtext></mml:msub><mml:msup><mml:mtext>&#x02009;</mml:mtext><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mtext>t</mml:mtext><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:msup><mml:mtext>&#x02009;</mml:mtext><mml:mo>+</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:msub><mml:mtext>n</mml:mtext><mml:mrow><mml:mtext>II</mml:mtext></mml:mrow></mml:msub><mml:msup><mml:mtext>&#x02009;</mml:mtext><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mtext>t</mml:mtext><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:msup><mml:mo stretchy='false'>)</mml:mo><mml:mo>*</mml:mo><mml:msub><mml:mtext>v</mml:mtext><mml:mtext>N</mml:mtext></mml:msub></mml:mrow></mml:math></disp-formula>
where n<sub>I</sub><sup>(t)</sup> and n<sub>II</sub><sup>(t)</sup> are the number of Type I and Type II voxels at time t, and v<sub>N</sub> is the volume of a single voxel.</p>
</sec>
</sec>
<sec id="S4" sec-type="discussion">
<title>Results and Discussion</title>
<p>Based on the three experiments, a mean volumetric difference of 19.12&#x02009;&#x000B1;&#x02009;5.36&#x02009;cm<sup>3</sup> was found (Table <xref ref-type="table" rid="T2">2</xref>). This amounts to approximately 1.9% of the volume of C<sub>T</sub>. While this percentage might seem small, even tiny changes to camera calibration could affect the 3D reconstruction of markers in space (Datta et al., <xref ref-type="bibr" rid="B4">2009</xref>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>Type I and II errors for all experiments at a voxel resolution of N&#x02009;&#x0003D;&#x02009;128</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left"/>
<th align="center" colspan="3">Experiment<hr/></th>
<th align="center">Mean</th>
<th align="center">SD</th>
</tr>
<tr>
<th align="left"/>
<th align="center">1</th>
<th align="center">2</th>
<th align="center">3</th>
<th align="center"/>
<th align="center"/>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Type I</td>
<td align="center">22.13</td>
<td align="center">26.40</td>
<td align="center">13.83</td>
<td align="center">19.12</td>
<td align="center">5.36</td>
</tr>
<tr>
<td align="left">Type II</td>
<td align="center">0.00</td>
<td align="center">0.00</td>
<td align="center">0.00</td>
<td align="center">0.00</td>
<td align="center">0.00</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Figure <xref ref-type="fig" rid="F6">6</xref> shows a plot of VolDiff<sub>N</sub><sup>(t)</sup> (cm<sup>3</sup>) over time (s) at varying voxel resolutions as observed in Experiment 1. At a voxel resolution of N&#x02009;&#x0003D;&#x02009;8, the total volumetric difference decreased to 0&#x02009;cm<sup>3</sup>. As mentioned, the intersection of Voxel centres between C<sub>R</sub> and C<sub>T</sub> was required for the removal of a voxel. For very coarse voxel resolutions (as in the case when N&#x02009;&#x0003D;&#x02009;8), voxels were only removed in the case of a big discrepancy between simulation and reality. After subsequent increases in voxel resolution, we observed that the total volumetric difference converged to 22.13&#x02009;cm<sup>3</sup>.</p>
<fig position="float" id="F6">
<label>Figure 6</label>
<caption><p><bold>Total volumetric difference (cm<sup>3</sup>) between initial and final shape-optimized C<sub>T</sub> over time (s) for different voxel resolutions in Experiment 1</bold>.</p></caption>
<graphic xlink:href="fbioe-03-00065-g006.tif"/>
</fig>
<p>At the beginning of the experiment (0&#x02013;2&#x02009;s), the sharp drop was a result of the removal of Type II voxels (Figures <xref ref-type="fig" rid="F3">3</xref>B and <xref ref-type="fig" rid="F7">7</xref>) when both cubes were in contact with each other. It would seem that no gaps were found between the cubes in all the experiments, and Type II voxels were not as problematic as had been anticipated. As all the outer voxels had been removed, subsequent increases in the combined volumetric error were due to Type I voxels (Figure <xref ref-type="fig" rid="F8">8</xref>) being removed at instances when both cubes were rubbing against each other.</p>
<fig position="float" id="F7">
<label>Figure 7</label>
<caption><p><bold>Type II volumetric errors (cm<sup>3</sup>) over time (s) for different voxel resolutions in Experiment 1</bold>.</p></caption>
<graphic xlink:href="fbioe-03-00065-g007.tif"/>
</fig>
<fig position="float" id="F8">
<label>Figure 8</label>
<caption><p><bold>Type I volumetric errors (cm<sup>3</sup>) over time (s) for different voxel resolutions in Experiment 1</bold>.</p></caption>
<graphic xlink:href="fbioe-03-00065-g008.tif"/>
</fig>
<p>For this technique to work, therefore, an appropriate voxel resolution had to be chosen carefully so as to determine the volumetric difference between the initial and final-shaped C<sub>T</sub>. In our study, a voxel resolution of N&#x02009;&#x0003E;&#x02009;64 demonstrated a convergence to the total volumetric difference.</p>
<p>In motion capture systems, the determination of the intrinsic and extrinsic parameters is performed separately. In our study, we evaluated the accuracy of the extrinsic parameter matrix during wand calibration, assuming that the intrinsic parameters were already known. Our proposed collision detection technique could also be utilized when intrinsic parameters are unknown or to be determined simultaneously with the extrinsic parameters. Larger errors, however, will be expected. This is because while it is simpler to use the wand calibration method to simultaneously determine both intrinsic and extrinsic parameters, all the parameters will ultimately become sensitive to the movement of the calibration wand<xref ref-type="fn" rid="fn1"><sup>1</sup></xref>. This may, therefore, result in larger calibration errors.</p>
<p>It is clear that there may be other sources of errors affecting the extrinsic calibration parameters, such as camera placement. In our study, average residuals of 0.46&#x02013;0.75&#x02009;mm were recorded during calibration, which were within the accepted range of 0.5&#x02013;1.5&#x02009;mm as recommended by Qualisys. It is, therefore, apparent that while intrinsic parameters such as lens distortion cannot be completely eradicated, random errors could still be determined and minimized. In a controlled environment such as in the laboratory, high quality calibration could easily be achieved. In measurements performed outdoors and/or where large capture volumes are required, it becomes difficult to consider all the 2D image coordinates simultaneously so as to attain the 3D coordinates required to determine the extrinsic camera parameters (Barreto and Daniilidis, <xref ref-type="bibr" rid="B1">2004</xref>). We are not suggesting that this technique should replace the commonly used calibration procedures (recommended by Qualisys and/or other motion capture systems), but our approach can instead complement these procedures to minimize calibration errors.</p>
<p>This technique could be an additional criterion to the wand calibration method, since this collision detection takes into account the errors that resulted from markers moving close to one another. The wand calibration method includes waving a wand with two markers at a fixed distance in the capture volume. Previous studies performing accuracy tests of several motion capture systems also utilized markers at fixed distances to each other (Dorociak and Cuddeford, <xref ref-type="bibr" rid="B5">1995</xref>; Ehara et al., <xref ref-type="bibr" rid="B7">1997</xref>; Richards, <xref ref-type="bibr" rid="B10">1999</xref>; Papic et al., <xref ref-type="bibr" rid="B9">2004</xref>). Compared to our technique, the use of fixed markers in the previous studies was straightforward and easier to implement. Distances between moving markers, however, are not constantly fixed during capture. The human body, consisting of multi-linkages, is fully capable of performing complicated 3D movements. Our technique, therefore, better replicates actual movements during motion capture.</p>
<p>In conclusion, we have proposed a procedure based on collision detection, which could be used as an indicator of calibration accuracy. This technique can complement current calibration methods to minimize calibration errors when simultaneous calibration of multiple cameras is required.</p>
</sec>
<sec id="S5">
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>The authors would like to acknowledge the European Research Council (ERC) under the European Union&#x02019;s Seventh Framework Program (FP7/2007-2013) ERC grant agreement n. 306757 and the Fraunhofer Attract Program for financial support of this research.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Barreto</surname> <given-names>J. P.</given-names></name> <name><surname>Daniilidis</surname> <given-names>K.</given-names></name></person-group> (<year>2004</year>). &#x0201C;<article-title>Wide area multiple camera calibration and estimation of radial distortion</article-title>,&#x0201D; in <conf-name>Proceedings of the fifth Workshop on Omnidirectional Vision, Camera Networks and Non-Classical Cameras</conf-name>, <conf-loc>Prague</conf-loc>.</citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>D. C.</given-names></name></person-group> (<year>1971</year>). <article-title>Close-range camera calibration</article-title>. <source>Photogramm. Eng.</source> <volume>37</volume>, <fpage>855</fpage>&#x02013;<lpage>866</lpage>.</citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>L.</given-names></name> <name><surname>Armstrong</surname> <given-names>C. W.</given-names></name> <name><surname>Raftopoulos</surname> <given-names>D. D.</given-names></name></person-group> (<year>1994</year>). <article-title>An investigation on the accuracy of three-dimensional space reconstruction using the direct linear transformation technique</article-title>. <source>J. Biomech.</source> <volume>27</volume>, <fpage>493</fpage>&#x02013;<lpage>500</lpage>.<pub-id pub-id-type="doi">10.1016/0021-9290(94)90024-8</pub-id><pub-id pub-id-type="pmid">8188729</pub-id></citation></ref>
<ref id="B4"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Datta</surname> <given-names>A.</given-names></name> <name><surname>Kim</surname> <given-names>J.</given-names></name> <name><surname>Kanade</surname> <given-names>T.</given-names></name></person-group> (<year>2009</year>). &#x0201C;<article-title>Accurate camera calibration using iterative refinement of control points</article-title>,&#x0201D; in <conf-name>Proceedings of IEEE 12th International Conference on Computer Vision, Workshop on Visual Surveillance</conf-name>, <conf-loc>Kyoto</conf-loc>.</citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dorociak</surname> <given-names>R. D.</given-names></name> <name><surname>Cuddeford</surname> <given-names>T. J.</given-names></name></person-group> (<year>1995</year>). <article-title>Determining 3-D system accuracy for the VICON 370 System</article-title>. <source>Gait Posture</source> <volume>3</volume>, <fpage>88</fpage>.<pub-id pub-id-type="doi">10.1016/0966-6362(95)93468-R</pub-id></citation></ref>
<ref id="B6"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Effenberger</surname> <given-names>I.</given-names></name> <name><surname>K&#x000FC;hnle</surname> <given-names>J.</given-names></name> <name><surname>Verl</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). &#x0201C;<article-title>Fast and flexible 3D object recognition solutions for machine vision applications</article-title>,&#x0201D; in <conf-name>Proceedings of Society of Photo-Optical Instrumentation Engineers (SPIE) 8661, Image Processing: Machine Vision Applications VI</conf-name>, <conf-loc>Burlingame, CA</conf-loc>.</citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ehara</surname> <given-names>Y.</given-names></name> <name><surname>Fujimoto</surname> <given-names>H.</given-names></name> <name><surname>Miyazaki</surname> <given-names>S.</given-names></name> <name><surname>Mochimaru</surname> <given-names>M.</given-names></name> <name><surname>Tanaka</surname> <given-names>S.</given-names></name> <name><surname>Yamamoto</surname> <given-names>S.</given-names></name></person-group> (<year>1997</year>). <article-title>Comparison of the performance of 3D camera systems II</article-title>. <source>Gait Posture</source> <volume>5</volume>, <fpage>251</fpage>&#x02013;<lpage>255</lpage>.<pub-id pub-id-type="doi">10.1016/S0966-6362(96)01093-4</pub-id><pub-id pub-id-type="pmid">21640352</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>H.</given-names></name> <name><surname>Holt</surname> <given-names>C.</given-names></name> <name><surname>Evans</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>Accuracy and repeatability of an optical motion analysis system for measuring small deformations of biological tissues</article-title>. <source>J. Biomech.</source> <volume>40</volume>, <fpage>210</fpage>&#x02013;<lpage>214</lpage>.<pub-id pub-id-type="doi">10.1016/j.jbiomech.2005.11.007</pub-id><pub-id pub-id-type="pmid">16376351</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Papic</surname> <given-names>V.</given-names></name> <name><surname>Zanchi</surname> <given-names>V.</given-names></name> <name><surname>Cecic</surname> <given-names>M.</given-names></name></person-group> (<year>2004</year>). <article-title>Motion analysis system for identification of 3D human locomotion kinematics data and accuracy testing</article-title>. <source>Simul. Model. Pract. Theory</source> <volume>12</volume>, <fpage>159</fpage>&#x02013;<lpage>170</lpage>.<pub-id pub-id-type="doi">10.1016/j.simpat.2003.08.008</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Richards</surname> <given-names>J. G.</given-names></name></person-group> (<year>1999</year>). <article-title>The measurement of human motion: a comparison of commercially available systems</article-title>. <source>Hum. Mov. Sci.</source> <volume>18</volume>, <fpage>589</fpage>&#x02013;<lpage>602</lpage>.<pub-id pub-id-type="doi">10.1016/S0167-9457(99)00023-8</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saini</surname> <given-names>H.</given-names></name> <name><surname>Waddell</surname> <given-names>J. N.</given-names></name> <name><surname>Pullan</surname> <given-names>A. J.</given-names></name> <name><surname>Roehrle</surname> <given-names>O.</given-names></name></person-group> (<year>2009</year>). <article-title>Automatically generating subject-specific functional tooth surfaces using virtual mastication</article-title>. <source>Ann. Biomed. Eng.</source> <volume>37</volume>, <fpage>1646</fpage>&#x02013;<lpage>1653</lpage>.<pub-id pub-id-type="doi">10.1007/s10439-009-9725-y</pub-id><pub-id pub-id-type="pmid">19484386</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Triggs</surname> <given-names>B.</given-names></name> <name><surname>McLauchlan</surname> <given-names>P.</given-names></name> <name><surname>Hartley</surname> <given-names>R.</given-names></name> <name><surname>Fitzgibbon</surname> <given-names>A.</given-names></name></person-group> (<year>2000</year>). <article-title>Bundle adjustment &#x02013; a modern synthesis</article-title>. <source>Lect. Notes Comput. Sci.</source> <volume>1883</volume>, <fpage>298</fpage>&#x02013;<lpage>372</lpage>.<pub-id pub-id-type="doi">10.1007/3-540-44480-7_21</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tsai</surname> <given-names>R. Y.</given-names></name></person-group> (<year>1987</year>). <article-title>A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lens</article-title>. <source>IEEE J. Rob. Autom.</source> <volume>3</volume>, <fpage>323</fpage>&#x02013;<lpage>344</lpage>.<pub-id pub-id-type="doi">10.1109/JRA.1987.1087109</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Z.</given-names></name></person-group> (<year>2000</year>). <article-title>A flexible new technique for camera calibration</article-title>. <source>IEEE Trans. Pattern Anal. Mach. Intell.</source> <volume>22</volume>, <fpage>1330</fpage>&#x02013;<lpage>1334</lpage>.<pub-id pub-id-type="doi">10.1109/34.888718</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn1"><p><sup>1</sup>Hofverberg, S. (2007). Theories for 3D reconstruction. Qualisys Technical Note.</p></fn>
</fn-group>
</back>
</article>