$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
The value of transcriptomic studies lies primarily in the quality of the starting biological material. If the RNA extraction is performed in optimal conditions, the RNA Integrity Number (RIN) is typically 7 or greater (Figure 4A). The need to hybridize 2 µg of cDNA on the Affymetrix HERV-V2 chip implies the use of an amplification process. A successful amplification step leads to a bell-shaped distribution (Figure 4B). Then, DNAse1 fragmentation is performed in order to homogenize the cDNA size distribution around 100 nucleotides before hybridization (Figure 4C). After hybridization and scanning (Figure 4D), a visual inspection of the image enables one to check if the grid is well aligned to the spots (Figure 4E) and if hybridization controls are consistent (Figure 4F). This step is also useful in order to exclude microarrays in which air-bubbles or errors occurred during the experiment.
Once the chips have passed QC (Figure 5) and after normalization, the statistical analysis of 5 match-pair tumor and normal prostate RNA samples from the Lyon-Sud Hospital led to the identification of 207 HERV probesets with differential expression values (p.val <0.05) (Figure 6A). To support these records and to gain prostate-specific information, 35 additional match-pair samples (colon, ovary, testis, breast, lung and prostate) were added to the analysis and the SAM-FDR procedure (FDR = 20%) eventually identified 44 prostate specific HERV probesets. Among them, the most relevant 10 HERV structures are described (Figure 6B). Further clinical studies will be required to assess the values of sensitivity and specificity of these candidate biomarkers.

Figure 1. Scheme of the overall procedure from the clinic (1: prostatectomy by the clinician and the tissue preparation by the pathologist) to the bench (2-6: sample preparation, target preparation, microarray processing) leading to the identification of candidate biomarkers (7: biocomputing analysis of the HERV microarrays). Nucleic acids derived from normal tissue are depicted in orange; nucleic acids derived from tumoral area consist of a mix of normal (orange) and tumor specific (black) nucleic acids. Click here to view larger image.

Figure 2. Conception and content of the HERV-V2 chip: HERV sequences retrieved from the human genome are stored in a database called HERV-gDB3, then the 25-mer candidate probes pass through a dedicated hybridization modeling procedure (EDA+) before being eventually synthesized on the array (the resulting targeted sub-regions are depicted for each family). Click here to view larger image.

Figure 3. Prostate handling by the pathologist. (A) Fresh radical prostatectomy specimen is transferred to the laboratory. (B-C) The prostate is stained (green on the right side, black on the left side). (D) Large transverse section of the gland on the posterior side. (E) Leaving the margins intact, pieces of tissues are dissected from different areas of the prostate gland. (F) Cores of tissue are placed in an Eppendorf tube. (G) Suture thread is used to close the prostate and to prevent gland distortion and minimal disruption of the surgical margin. Then, the radical prostatectomy specimen is ready for fixing in formalin according to the usual procedure for histological analysis. Click here to view larger image.

Figure 4. Quality controls of nucleic acid preparation and hybridization efficiency. (A) RNA integrity, (B) cDNA amplified targets and (C) fragmented targets used in the hybridization stage. These three quality controls were obtained with the Bioanalyzer using RNA nano chips and the Eukaryote Nano Serie II assay. (D) Overall image of the HERV-V2 microarray hybridization area after scanning, (E) enlargement of the upper left corner showing grid alignment controls and (F) enlargement of the center area showing spotting hybridization controls. Click here to view larger image.

Figure 5. Processing of signals. (A) Affymetrix polyA spike-in amplification controls. The polyA controls Dap, Thr, Phe and Lys transcripts from B. subtilis genes are spiked in the RNA sample and serve to assess the overall success of the target preparation steps. Intensity should be detected at decreasing values among these spike-in controls to ensure that there was no bias during the WT-Ovation amplification between highly- and low-expressed genes. (B) Affymetrix spike-in hybridization controls. These targets isolated from E. coli and P1 bacteriophage are spiked before the labeling procedure. Increasing values from BioB, BioC, BioD and Cre indicate the overall success of the hybridization. (C) Intensity distribution of the chip signals after RMA normalization. Most of the probesets exhibit signals with values lower than 26 (background), indicating an overall expression mainly restricted to some specific HERV loci. Click here to view larger image.

Figure 6. Data analysis. (A) Hierarchical clustering analysis of normal and tumoral samples. Partitioning clustering was applied to the normalized expression values using a Euclidean distance function algorithm, grouping probesets into up (red)- and down (blue)-regulation among normal and tumoral samples. (B) Selection of the top 10 HERV structures identified as candidate biomarker of prostate cancer. For each HERV element, the related HERV family, the genomic coordinates (NCBI 36/hg18) and a brief description of the HERV structure are given. Click here to view larger image.

Figure 7. The HERV repertoire. (A) Sequencing of the human genome revealed 25,000 protein-coding genes (exons, 2%) and a huge amount of transposable elements including 200,000 long-terminal repeat (LTR) retrotransposons (HERV, 8%). (B) Extrapolation from HERV-V2 chip content and associated expression data (79 samples originating from 8 normal versus tumoral tissue types) suggest that one third of the HERV repertoire is transcriptionally active. Click here to view larger image.

Figure 8. Functional interpretation of signals from the chip. (A) Promoter identification and epigenetic control: U3 negative signal (red probe, 5'LTR) versus R-U5 positive signal (blue probe, 5'LTR) suggest U3-driven transcription, supported by the different CpG methylation (solid black circles) content of U3 in peritumoral normal versus tumoral tissues. (B) Splicing strategy: the putative 3.1 kb envelope encoding mRNA expressed exclusively in the tumor is identified using SD1/SA2 splice junction overlapping probe. *Deduced by the comparison with other non-placental tissues. Click here to view larger image.