Mechanical testing of rodent bones is a valuable method for extracting information regarding a bone’s susceptibility to fracture. Lacking proper practical understanding, the results may be overinterpreted or lack validity. This protocol will serve as a guide to ensure mechanical tests are performed accurately to provide valid and functional data.
Skeletal fragility leading to fracture is an American public health crisis resulting in 1.5 million fractures each year and $18 billion in direct care costs. The ability to understand the mechanisms underlying bone disease and the response to treatment is not only desired, but critical. Mechanical testing of bone serves as a valuable technique for understanding and quantifying a bone's susceptibility to fracture. While this method appears simple to perform, inappropriate and inaccurate conclusions may be reached if governing assumptions and key steps are disregarded by the user. This has been observed across disciplines as studies continue to be published with misuse of methods and incorrect interpretation of results. This protocol will serve as a primer for the principles associated with mechanical testing along with the application of these techniques-from considerations of sample size through tissue harvesting and storage, to data analysis and interpretation. With this in hand, valuable information regarding a bone's susceptibility to fracture may be obtained, furthering understanding for both academic research and clinical solutions.
Mechanical testing of bone is the primary method to extract functional information related to a bone's susceptibility to fracture. In preclinical studies, several testing modalities can be used but by far the most common is the bending of long bones. These tests are easy to perform and can be used on bones ranging in size from human to mouse. As mice are one of the most commonly studied animals in preclinical research, this protocol will focus on bending tests performed on the femora and tibiae of mice.
Prior to performing bending tests, bones must be properly harvested and stored. The most common storage methods have traditionally been freezing bones in saline-soaked gauze, freezing in saline alone, or dehydrating bones in ethanol 1. Bones stored in ethanol have been shown to have increased stiffness and elastic modulus and decreased deformation parameters versus those stored frozen1. Even rehydrating the bones prior to testing does not recover these properties back to normal levels 1. Storing submerged in saline could cause damage to the bone since pressure is exerted as the saline expands. In addition, a complete thaw of the solution would be required to remove the bones for microcomputed tomography (µCT) scanning. Consequently, freezing freshly harvested bones in saline-soaked gauze has become the standard storage method and is recommended throughout this protocol.
Because the size and shape of a bone affect its bulk strength and many disease models significantly alter bone size and morphology, engineering principles are used to normalize away the effects of size to produce properties that estimate the behavior of the tissue2. This approach requires cross-sectional geometry of the failure location, which is most commonly acquired using µCT to create scans of the bones prior to testing. µCT is widely used due to its availability and high image resolution. Moreover, contributions of soft tissue are not included, and scanning does not require chemical fixation or other modifications to the bone3,4. In all forms of CT, an X-ray source is focused on an object while a detector on the other side of the object measures the resulting X-ray energy. This produces an X-ray shadow of the sample that can be converted into an image3,5. The object being scanned is rotated (or the X-ray source and detector are rotated around the sample), generating images that can be reconstructed into a three-dimensional data set representing the object5.
Scan resolution, or how close together two objects can be and still be resolved individually, is controlled by changing the nominal voxel size or the size of a pixel in the resulting image. It is generally accepted that objects must be at least two times the size of a single voxel to be identified3, but a higher ratio will allow for improved precision. Further, larger voxels are more prone to partial volume effects: when a single voxel contains tissues of varying densities, it is assigned the average of these densities, rather than the specific density of a single tissue, which may lead to an over- or under-estimation of tissue areas and mineral density3. While these issues can be mitigated by choosing smaller voxel sizes, using a higher resolution does not ensure the elimination of partial volume effects and may require longer scan times3. When scanning bones ex vivo, a voxel size of 6-10 µm is generally recommended to accurately assess the trabecular architecture of mouse bones. A larger voxel size of 10-17 µm can be used for cortical bone, although the smallest reasonable voxel size should be used. This protocol uses a 10 µm voxel size, which is small enough to differentiate key trabecular properties and minimize partial volume effects without extensive scan time.
X-ray energy and energy filter settings must also be selected carefully, as the high mineral density and thickness of bone tissue greatly attenuates and alters the transmitted X-ray energy spectrum. It is generally assumed that because the emitted X-ray spectrum is equivalent to the spectrum that exits the object6, using low-energy X-rays on dense objects such as bone can lead to an artifact known as beam hardening7. A higher voltage of 50-70 kVp is recommended when scanning bone samples to reduce the incidence of these artifacts5. In addition, inserting an aluminum or copper energy filter creates a more concentrated energy beam, further minimizing artifacts4,7. A 0.5 mm aluminum filter will be used throughout this protocol.
Finally, the scan rotation step and rotation length (e.g., 180°-360°), together control the number of images captured, which determines the amount of noise in the final scan4. Averaging multiple frames in each step can reduce noise but may increase scan time4. This protocol uses a rotation step of 0.7 degrees and a frame averaging of 2.
One final note about scanning: hydroxyapatite calibration phantoms should be scanned using the same scan settings as the experimental bones to enable the conversion of attenuation coefficients to mineral density in g/cm35. This protocol uses phantoms of 0.25 g/cm3 and 0.75 g/cm3 of hydroxyapatite, although different phantoms are available. Note that some scanning systems use internal phantoms as part of daily system calibration.
Once scanning is complete, the angular projections are reconstructed into cross-sectional images of the object, typically using the manufacturer's accompanying software. Whatever system is used, it is important to ensure that the whole bone is captured in the reconstruction and that thresholding is set appropriately to allow for the recognition of bone versus non-bone. After reconstruction, it is critical to rotate all scans in three dimensions so that bones are oriented consistently and properly aligned with the transverse axis, again using the manufacturer's software.
Following rotation, regions of interest (ROI) for analysis may be selected based on whether cortical properties, trabecular properties, or fracture geometry for mechanical normalization are desired. For the latter, ROIs should be selected after testing by measuring the distance from the fracture site to one end of the bone and using voxel size to determine the corresponding slice location in the scan file. The selected region should be at least 100 µm in length, with the fracture point at the approximate center of the ROI, to provide adequate estimation4.
With ROIs selected, two properties are needed for mechanical normalization (to calculate bending stress and strain): the maximum distance from the neutral bending axis to the surface where failure is initiated (assumed to be the surface loaded in tension, determined by the testing setup), and the area moment of inertia around the neutral axis, (also dependent on testing setup). This protocol recommends the use of a custom code to determine these values. For access to the code, contact the corresponding author directly or visit the lab website at https://bbml.et.iupui.edu/ for more information.
Once µCT scanning has been completed, mechanical testing can begin. Bending tests can be performed in either four-point or three-point configurations. Four-point bending tests are preferred as they eliminate shear stress in the bone between loading points, allowing for pure bending to occur in this region3. The bone will then fracture due to tension, creating a failure that is more representative of the true bending properties of the bone3. However, the bone must be loaded in such a way as to deliver the same load at both loading points (this can be facilitated with a pivoting loading head). In three-point bending tests, there is a large change in shear stress where the load point meets the bone, which causes the bone to break at this point due to shear, not tension3. ASTM standards recommend materials undergoing bending should have a length-to-width ratio of 16:1, meaning the length of the support span should be 16 times larger than the width of the bone to minimize impacts of shear8,9. This is often impossible to achieve when testing small rodent bones, so the loading span is simply made as large as possible but with as small of a change in cross-sectional shape as possible. Moreover, when performing four-point bending, the ratio between the lengths of the lower and upper span should be ~3:18, which can usually be achieved in the tibia, but it is difficult in the shorter femur. In addition, the thinner cortical walls of femurs make them susceptible to ring-type deformation which changes the shape of the bone cross-section during the test (this can be accentuated in four-point tests as a greater force is required to induce the same bending moment compared to three-point bending). Therefore, three-point bending will be utilized for mouse femora while four-point bending will be used for tibiae throughout this protocol.
Finally, it is important to properly power the study for statistical analysis. A general recommendation for mechanical testing is to have a sample size of 10-12 bones per experimental group to be able to detect differences, as some mechanical properties, especially postyield parameters, can be highly variable. In some cases, this may mean starting with a higher animal sample size given attrition that could occur during the study. Sample size analysis using existing data should be completed prior to attempting a study.
There are numerous limitations and assumptions, but bending tests can provide quite accurate results, especially when relative differences between groups are of interest. These properties, together with the analysis of trabecular architecture and cortical morphology, can provide better insight into disease states and treatment regimens. If care is taken with those aspects of the experiment that are in our control (e.g., harvesting, storing, scanning, and testing), we can feel confident that accurate results have been generated.
All procedures described throughout this protocol that involved animals have been approved by the Indiana University School of Science Institutional Animal and Use Committee (IACUC) prior to the procedure. Animals described in the procedure were euthanized via CO2inhalation followed by cervical dislocation as a secondary means of euthanasia.
1. Harvest, storage, and thawing of bones
2. µCT scanning
3. µCT reconstruction
4. µCT rotation
NOTE: Once reconstructed, scans must be rotated to establish consistent orientation across all bones, and to ensure that transverse sections of the resulting bone are taken normal to the longitudinal axis with as little offset angle as possible. This should be done with the user's software of choice.
5. Mechanical testing procedure
Figure 1: Mechanical tester setup. (A) Three-point and (B) four-point bending tests. The load cell is shown in yellow, the loading fixtures are shown in blue, and the support fixtures are shown in green. Please click here to view a larger version of this figure.
Figure 2: Orientation of bones between fixtures. (A) Proper orientation of a femur in three-point bending loading fixtures showing (from top to bottom) views from the medial, anterior, and posterior side of the femur when correctly positioned. The loading fixtures are shown in orange and the support fixtures are shown in blue. The bottom spans should be adjusted to include as much of the straightest part of the diaphysis as possible, and the top fixture should be centered between those spans. (B) Proper orientation of a tibia for four-point bending showing (from top to bottom) views from the anterior, lateral, and medial sides of the tibia. The bone should be loaded so that the medial surface contacts the bottom fixture, and the lateral surface contacts the top fixture. The tibia-fibula junction should be placed just outside of the loading span. The spans should be adjusted to best meet a loading-to-support span ratio of 1:3. Please click here to view a larger version of this figure.
6. ROI selection
7. Normalization of force and displacement data
NOTE: The mechanical tester will only generate points with x and y coordinates (displacement, force). These points can be converted to stress and strain using the Euler-Bernoulli bending stress and strain equations, but these require geometric properties obtained from µCT scans. The quantification of these properties can be performed with the user's preferred software. We prefer a custom code, which gives complete control over all inputs, calculations, and outputs. As mentioned earlier, for access to the code, contact the corresponding author directly or visit the lab website at https://bbml.et.iupui.edu/ for more information. The stress and strain equations, as well as the necessary geometric properties that must be obtained from µCT scans, to calculate these are discussed below.
8. Mechanical testing properties of interest
Figure 3: Force-displacement and stress-strain curves. (A) Ideal force-displacement curve; (B) ideal stress-strain curve with the line derived from the 0.2% offset method used to calculate the yield point shown in red (note that this line has the same slope as that of the elastic region of the curve). Key properties that may be obtained from the force-displacement curve include yield force, ultimate force, displacement to yield, total displacement, and work. Tissue level properties that may be obtained from the stress-strain curve include yield stress, ultimate stress, strain to yield, total strain, resilience, and toughness. Please click here to view a larger version of this figure.
Upon completion of CT scanning, most inadequate scans can be caught in reconstruction. Often, poor scans will have a high misalignment compensation that is a clear indicator of an error during the scan. However, errors may occur in other steps and could also lead to inaccurate data. These errors can often be spotted as the individual calculated architectural properties are examined. If values fall far out of the range of others in a group, the scan, ROI, and method of calculating the properties should be re-examined.
Once bending tests are complete, the force-displacement plots from each test should be examined to identify poor tests that may need to be removed from the dataset. An example of a problematic test is shown in Figure 4. The plot in Figure 4A shows the results of a correctly performed bending test. There is a clear linear region lacking a low-slope toe-in, a yield point, an ultimate point (maximum force), a drop in force as displacement extends past the maximum force, and a failure point. The curve is smooth with no abrupt changes in loading until after the ultimate point has been reached. Consequently, properties from this test may be easily identified and trusted. The plot shown in Figure 4B shows the result of a bending test with multiple concerning features. The abrupt changes in loading and the appearance of multiple peaks in the plot are major indicators of problems with this test. While minor peaks may occur in a proper test near the ultimate force, the magnitude and number of peaks in this plot suggest the bone may have rolled during the test. Whether observed and noted during the test or when examining the tests prior to analysis, the sample data should be investigated during post-testing analysis. If the data are indeed faulty or far outside of the range of other samples in the group, it would be ideal to not include this test in the final dataset. This is one reason for properly powering the experiment with priori power calculations. It might be possible to report only certain properties from a sample (in this case, preyield properties could be acceptable), but this is not ideal and should be clearly explained when reported.
Figure 4: Force-displacement plots. (A) Ideal force-displacement plot. (B) Force-displacement plot resulting from a poor bending test. Please click here to view a larger version of this figure.
With all force-displacement plots examined and values normalized to stress-strain, the properties of interest may be identified and showcased in multiple ways. In Figure 5, resulting force-displacement and stress-strain plots for an entire study are shown. These are schematic representations whereby the force and displacement at the starting point (0,0), yield, ultimate point, and failure for each bone are determined and then, the force/stress and displacement/strain are averaged to provide an average plot for each group. The plots are not used for statistical analysis, but they can be used to showcase how overall behavior varies due to factors such as a treatment or disease state. The plots shown in Figure 5 are from a study comparing control mice to those induced with a state of type 2 diabetes and chronic kidney disease (T2D-CKD). The right tibiae of these animals were tested to failure using four-point bending and analyzed to obtain the properties discussed in protocol section 8. From Figure 5, it is clear the T2D-CKD group had reduced mechanical properties, including strength and stiffness, at both the structural and tissue levels. These mice also appear to have reduced postyield properties, an indicator of brittleness. These plots should not be used to draw definitive conclusions from a study. Rather, they act as a visual representation and should be verified by performing a statistical analysis on all properties of interest.
Figure 5: Force-displacement and stress-strain plots for an entire study. (A) Force-displacement plot for control animals and type 2 diabetes- and chronic kidney disease-induced animals. This plot resulted by averaging the yield force, displacement to yield, ultimate force, ultimate displacement, failure force, and total displacement for each group and plotting these means along with standard deviation. (B) Stress-strain for control animals and T2D-CKD animals. This plot resulted from averaging yield stress, strain to yield, ultimate stress, ultimate strain, failure stress, and total strain and plotting the resulting means along with standard deviation. Abbreviation: T2D-CKD = type 2 diabetes- and chronic kidney disease-induced animals. Please click here to view a larger version of this figure.
The mechanical data and results from a two-tailed t-test are shown in Table I. Data are presented as mean ± standard deviation. A general examination suggests data within proper ranges and with expected levels of variability. Note that postyield properties tend to have the greatest variation and therefore, often require the largest sample sizes to detect meaningful differences. As the schematic curves in Figure 5 suggested, there are significant decreases in almost all structural and tissue-level mechanical properties. From these data, it can be concluded that the induced disease state led to bones that are weaker, less stiff, and more susceptible to fracture due to loss of deformability and reduced toughness. Studies with more nuanced comparisons may not be as simple to interpret. An example of this may be if significant improvements are observed across structural-level mechanical properties but not tissue-level mechanical properties. In this case, the effects observed are likely driven by changes in the architecture of the bone (e.g., increased area, increased cortical thickness) rather than improvements in bone quality at the tissue level. For example, bone area increased due to the gain of woven bone, but tissue quality has decreased as an unorganized woven bone is now present rather than an organized lamellar bone. This could be supported by µCT analysis in which statistically significant improvements in architecture may be observed. Contrastingly, there may be significant improvements in tissue-level mechanical properties with minimal/no improvements in structural-level mechanical properties. This high tissue quality could mask the pitfalls of smaller bones. Data interpretation may become further convoluted if changes are observed in preyield properties but not in postyield properties or vice versa. In the former case, a change in the bone's ability to resist deformation may be improved while its ability to tolerate damage is not. In each of these cases, the ability to reference architectural properties from µCT analysis is highly beneficial and should be utilized (although describing that method is beyond the scope of this paper). Due to the complexity of the interpretation of these properties, presenting all properties in table or figure form (not just those properties that tend to be the easiest to interpret such as ultimate force, or that tell the story one is looking to tell) allows for a more complete representation of the mechanical impacts.
Control | T2D-CKD | P Value | |||
Yield Force (N) | 19.7 ± 2.9 | 15.2 ± 2.6 | 0.0032** | ||
Ultimate Force (N) | 22.8 ± 3 | 17.6 ± 3.4 | 0.0031** | ||
Displacement to Yield (µm) | 205 ± 17 | 190 ± 21 | 0.1039 | ||
Postyield Displacement (µm) | 246 ± 235 | 60 ± 51 | 0.0435* | ||
Total Displacment (µm) | 451 ± 230 | 249 ± 53 | 0.0278* | ||
Stiffness (N/mm) | 110 ± 10 | 91 ± 13 | 0.0037** | ||
Work to Yield (mJ) | 2.16 ± 0.45 | 1.54 ± 0.36 | 0.0055** | ||
Postyield Work (mJ) | 4.24 ± 3.01 | 1.04 ± 0.9 | 0.0109* | ||
Total Work (mJ) | 6.4 ± 2.88 | 2.58 ± 0.97 | 0.0025** | ||
Yield Stress (MPa) | 180 ± 20 | 157 ± 25 | 0.0504 | ||
Ultimate Stress (MPa) | 209 ± 26 | 181 ± 27 | 0.0434* | ||
Strain to Yield (mɛ) | 16.8 ± 2 | 16.4 ± 1.5 | 0.5771 | ||
Total Strain (mɛ) | 36.6 ± 17.2 | 21.5 ± 4.3 | 0.0277* | ||
Modulus (GPa) | 12.2 ± 1.1 | 10.9 ± 1.1 | 0.0171* | ||
Resilience (MPa) | 1.62 ± 0.33 | 1.38 ± 0.33 | 0.1377 | ||
Toughness (MPa) | 4.85 ± 2.29 | 2.26 ± 0.73 | 0.0076** |
Table 1: Results of mechanical testing and statistical analysis. Values shown as mean ± standard deviation. P values result from a two-tailed unpaired t-test. * P < 0.05 and ** P < 0.01. Abbreviation: T2D-CKD = type 2 diabetes- and chronic kidney disease-induced animals.
Throughout the scanning and testing process, there are moments when troubleshooting and optimization are appropriate. The first of these occurs when scanning bones using µCT. While many systems come with a holder in which one object may be held and scanned, custom holders can be fabricated to scan multiple bones at the same time. Scanning multiple bones can be an excellent point for optimization, but caution should be taken throughout the scanning and analysis process to ensure artifacts are not being induced. As X-rays pass through (and are attenuated by) varying amounts of bone tissue at each angular increment, this can lead to inaccuracy in the resulting data.
A second point for troubleshooting occurs in the reconstruction of scans. Depending on the program used, the user may define the density window for reconstruction, calculated from the attenuation coefficient values of the scan. This is a linear coefficient used to represent how much the X-ray beam is attenuated by an object5. Some programs obtain these densities and convert them to grayscale values ranging from 0 to 255 in a histogram. Two values will be shown in a histogram and are known as contrast limits10, which must be set appropriately to allow for recognition of bone versus non-bone voxels. The lower value will normally be set to a grey scale value of zero while the higher contrast value is recommended to be set to 10-20% of the maximum attenuation of the material of interest (bone)10. Errors may occur in analysis if this value is not set appropriately as portions of data could be cut off. Consequently, this value needs to be adjusted depending on the sample that was scanned. The best practice is to view the histogram on a logarithmic scale and select the upper limit as a number that is slightly larger than the end of the logarithmic tail to ensure all bone data are included10.
Additional points for troubleshooting occur during mechanical testing and analysis. During testing and viewing force-displacement curves after the test, samples that rolled may be spotted and should be removed from the dataset as discussed in the representative results. Moreover, caution should be taken regarding the orientation of the bone as there are assumptions governing the shape of the bone being tested. When using Euler-Bernoulli bending equations to calculate stress and strain, it is assumed the sample has a uniform cross section along its length3. Given that most bones do not have a uniform cross section, it is best to select the most uniform region of the bone to test (across the entire support span for three-point bending or between the loading points for four-point bending).
In a femur, testing in three-point bending at the midshaft is preferred. Because of the gentle curvature of the bone, one should test in the direction of the curvature to prevent local surface buckling (i.e., testing the femur with the anterior surface in tension). The tibia has a more variable cross-sectional shape, so the ideal region to test begins just proximal to the junction of the tibia and fibula. If the bone is placed with the medial surface in tension, the region of bone being tested is flat and has the least variability in radius and moment of inertia in the direction of bending. Caution should also be taken when interpreting the results of the bending tests due to the assumption that the material is isotropic, homogenous, and linearly elastic – each of these assumptions is violated to some degree when testing bone3. Bone's inability to fit these assumptions leads to results from bending tests that must be interpreted with caution. The properties that must be interpreted most carefully are those derived from the stress-strain curve past the yield point since by definition, passing yield violates the linear elastic assumption. While normalization to the bone's shape may be done, it is not recommended to attempt to normalize to animal body weight unless there are gross differences across groups. In this case, an analysis of covariance may be performed to compensate for these differences but general normalization to body weight should be avoided in most cases.
Despite the troubleshooting issues that may occur throughout this process, bending tests yield mechanical properties that can describe a bone's susceptibility to fracture. These tests are also relatively simple and quick to perform. Although absolute values from these tests might not always be completely valid, being able to detect relative differences between groups can be quite accurate for samples of various sizes and shapes. The mechanical properties obtained provide functional relevance in studies where differences in bone might be expected. Although monotonic failure tests are the most common and easily accessible mechanical test, other methods including fatigue life and fracture toughness may reveal additional mechanical properties of interest and could be considered.
The authors have nothing to disclose.
The work done to develop this protocol has been supported by the National Institutes of Health [AR072609].
CTAn | Bruker | NA | CT Scan Analysis Software |
DataViewer | Bruker | NA | CT Scan Rotation Software |
Matrix Laboratory (MATLAB) 2023a | MathWorks | NA | Coding platform used for data analysis |
NRecon | Bruker | NA | CT Scan Reconstruction software |
SKYSCAN 1272-100 kV w/ 16 MP CCD detector, incl 3D Suite Software | Micro Photonics Inc | SKY-016814 | Micro-CT system that can non-destructively visualize up to 209 mPs in every virtual slice through an object |