$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
This procedure, which was recently published in Nature Protocols1 has been cited over 3,000 times in the archival literature and has become a routine measurement for lignin and tannin characterization since it provides essential, rapid, and reproducible structural information.
Lignin and tannins
When Green Chemistry was introduced by Paul T. Anastas and John C. Werner2,3, it drastically changed the general conception of Chemistry. In particular, the importance of employing sustainable materials instead of fossil feedstocks, such as oil and coal, as a starting point is highlighted as a crucial aspect2,3. Among the different types of biomass, lignin is the most abundant aromatic biopolymer and can be seen as a potential source for industrial commodities and high-added value products4.
Lignin is the second most abundant wood-constituent (with cellulose being first and hemicellulose third). Its content in plants varies depending on the plant-type: for example hardwoods characterized by a lower amount of lignin compared to softwoods (20% ± 4% vs. 28% ± 4%). In addition, lignin distribution within vegetable tissue is not homogeneous: the higher lignin content can be found in the cell wall5,6. Lignin is a polyphenolic material industrially obtained as a by-product of the paper/cellulose industry7. It is recovered from the wood pulping process, in which wood chips are primarily processed in the presence of OH- and/or OH- + HS- ion conditions to separate cellulose from hemicellulose and lignin (Soda and/or Kraft processes)8,9.
The first attempts to study lignin were made by Payen and Schultze, respectively, in 1838 and 186510. In 1977, Adler summarized all the relevant available knowledge of that time11. It is currently recognized that the lignin building blocks are three phenyl-propanoid units: p-coumaryl, coniferyl, and sinapyl alcohols. These monomers, thanks to a free radical polymerization process, give rise to p-hydroxyphenyl, guaiacyl, and sinapyl units that eventually broadly constitute lignin (Figure 1)12. The lack of a primary structure in lignins implies an inherent difficulty for its structural characterization. Accordingly, the evaluation of the distribution of molecular weight has always been somewhat controversial. Milled wood lignin, the lignin isolated under mild conditions that approximate mostly protolignin10, is composed of oligomers13 which highly interact via supramolecular aggregation processes14,15.

Figure 1: A representative model of softwood lignin in which the different types of bonds are highlighted. Please click here to view a larger version of this figure.
Lignins are commonly classified depending on: (a) the type of wood from which they are derived (e.g., hardwood and softwood), (b) the process used to isolate it. The most crucial industrial lignin types are Kraft, Lignosulfonates, and Organosolv.
The structure of lignin is highly dependent upon its origin and processing chemistry. More specifically, when the rather complex and irregular structure of lignin is compounded with its natural diversity and the complex processing chemistries, a material of extreme variability, diversity, and heterogeneity emerges, limiting its use to low-value applications16. While softwood lignins contain mainly guaiacyl units (G) with negligible amounts of p-hydroxyphenyl groups (G lignin), hardwood lignins are composed by guaiacyl and syringyl subunits (GS lignin) in varying ratios and grass lignins are constituted by guaiacyl, syringyl, and p-hydroxyphenyl (GSH lignin) subunits. The extractive approach used for isolation dramatically affects the structure of the emerging lignin17. Figure 2 depicts three lignin structures, differing by the isolation approach employed. Some considerations regarding the effect of the extraction method could be highlighted. Firstly, Kraft lignin is a dealkylated, highly fragmented, and condensed lignin, while Organosolv lignin has a structure similar to milled wood lignin (isolated using the Bjorkman approach)18,19,20. Finally, lignosulfonates are characterized by a high degree of sulfonation, depending on the intensity and the conditions of the extractive sulfonation process.

Figure 2: Representative structures for technical lignins. In this figure, the differences among the different types of lignin can be seen. (A) Softwood Kraft lignin is highly condensed, (B) lignosulfonates are characterized by sulfonic groups on saturated carbons, and (C) organosolv lignin has a structure similar to the one of milled wood lignin. Please click here to view a larger version of this figure.
Similar to lignins, tannins are polyphenolic compounds that are found in plants. A recent and updated review on tannins' extractive approaches and applications was recently released by Das et al.21. The importance of tannins in everyday life can be highlighted considering two examples: they impart taste and color to wines22; furthermore their poly-phenolic structure offers antioxidant characteristics and makes them ideal for application in the tanning industry23. Tannins are divided into two classes: hydrolyzable and non-hydrolyzable. Hydrolyzable tannins can be considered a polymer of gallic, di-gallic, and ellagic acid esters (Figure 3). These esters result from the esterification of the phenolic acids with sugar molecules (e.g., glucose, rhamnose, and arabinose).

Figure 3: Typical hydrolysable tannins: tannic acid, vescalgin. Please click here to view a larger version of this figure.
Non-hydrolyzable tannins, also known as condensed tannins, are polymers and oligomers deriving from flavan-3-ols. Among flavan-3-ols, catechins and gallocatechin are the most frequent. They are colorless crystalline compounds (Figure 4). The polymerization creates a polymer characterized by a helicoidal structure. The aromatic hydroxy groups are directed on the exterior of the helix, while the pyran oxygens are in the interior.

Figure 4: Proantocyanidin structures: R =H, OH, OCH3. Please click here to view a larger version of this figure.
Characterization of lignins and tannins using NMR
Two types of information are crucial in lignin or tannin characterization: (a) chemical structure (e.g., hydroxy group content, nature, and frequency of interunit linkages) and (b) molecular weight and polydispersity. Since the early studies on lignin, different techniques have been employed to achieve these goals, and two classes of methods have emerged: chemical and physical methods.
In lignin chemistry, chemical methods, such as alkaline nitrobenzene oxidation, derivatization followed by reductive cleavage, permanganate oxidation, and thioacidolysis, have been historically widely used24,25,26,27,28,29. However, even if the analytical protocols have been implemented and optimized, they are time-demanding, laborious, and require extensive experimental skills30. Alternatively, from the beginning of the instrumental analysis, physical methods have been used to perform lignin and tannin characterizations31. These techniques allow overcoming the problems of classical methods making it easy to characterize lignin structure.
Nuclear Magnetic Resonance (NMR) allows obtaining information about lignin structure and chemical composition among the instrumental techniques. In particular, data from quantitative monodimensional 1H NMR spectra and quantitative 13C NMR spectra can provide information about different types of lignin interunit bondings32,33,34,35. Unfortunately, monodimensional spectra suffer from signal overlap, which can seriously undermine signal integration efforts. Quantitative versions of HSQC (Heteronuclear Single Quantum Coherence), Q-HSQC (Quantitative - Heteronuclear Single Quantum Coherence), have been used to understand lignin structure better, providing helpful information about internal linkages. However, they cannot be fully utilized to determine the various buildings units13,36,37 quantitatively.
To overcome the issues associated with mono- and two-dimensional NMR, substrate derivatization has been considered. Among the advantages of this approach is that specific labels can be introduced within the complex macromolecule and no spectral interference results from the solvent in which the labeled substrates are dissolved1. Verkade was the pioneer in this field, performing 31P NMR analysis of phosphorous derivatives, coal derivatives, and related compounds38. In its publication, a screening of different phosphorus-containing reagents (phospholanes) was performed, and the chemical shift of other labeled compounds was recorded. Argyropoulos' team first introduced derivatization for the quantitative and qualitative analysis of hydroxy groups in lignin in 1991. After studying the derivatization of lignin model compounds using phosphorus-containing reagents, his group paved the way for one of the most daily-used techniques in lignin chemistry, 31P NMR analysis39,40,41,42,43. Among the different phospholanes examined, Argyropoulos arrived at the use of 2-chloro-4,4,5,5-tetramethyl-1,3-2-dioxaphospholane (TMDP) as being the most suitable one to perform lignin analysis44. TMDP selectively reacts with hydroxy groups causing the quantitative formation of phosphorus-containing derivatives characterized by specific 31P NMR chemical shifts (Figure 5).

Figure 5: Lignin and tannin phosphytilation chemistry. Labeling lignin and tannin labile H groups is accomplished by in situ reaction. The labeled polyphenols are characterized by specific 31P NMR bands corresponding to the different type of hydroxy groups. Please click here to view a larger version of this figure.
Sample derivatization is performed in a pyridine/chloroform (1.6:1) mixture; this choice results from an accurate evaluation. Pyridine has two advantages. Firstly, selecting a solvent characterized by a Hildebrand parameter of about 22.1 MPa1/2 simplifies and amplifies lignin solubilization45. Consequently, the addition of pyridine as a solvent, whose Hildebrand parameter equals 21.7, is thus optimal. Secondly, the reaction of TMDP with hydroxy groups is accompanied by the formation of hydrochloric acid (HCl) as a by-product with concomitant negative implications toward the facile formation of lignin-phospholane derivatives. For this reason, the resulting HCl needs to be neutralized. When present in significant excess, the basicity of the pyridine, relative to TMDP, allows for the neutralization of the HCl (via the formation of pyridine hydrochloride).
The use of the recommended pyridine/deuterated chloroform binary solvent system is based on three reasons. Firstly, it favors sample dissolution. Secondly, as pyridine hydrochloride is soluble in chloroform, it can prevent precipitation and deterioration of the final spectrum. Thirdly, deuterated chloroform is chosen for its unique singlet signal, allowing locking of the NMR spectrometer during the acquisition process. Sample derivatization is performed in the presence of an internal standard. In this way, when the sample and the standard are derivatized, the comparison of the integrals of the peaks of the sample and the standard allows the quantification of the amount for each type of hydroxy group present. Various compounds have been considered as internal standards. These compounds are characterized by a single hydroxy group per molecule, offering a single sharp signal in the 31P NMR spectrum after derivatization. The selection of the standard must be made carefully. Its signal should not overlap with those of the derivatized sample. Cholesterol was widely used during the early days. However, a partial overlap with signals arising from aliphatic hydroxy group limits its use. For routine analysis, internal standard solutions of N-hydroxy-5-norbornene-2,3-dicarboximide (NHND) are preferred. However, owing to NHND instability, its standard solutions can be stored only for a few days46.