University of Bath, Bath, UK

MYH11 (myosin heavy polypeptide 11) and NDE1 (Nude1) are transcribed from opposite strands of human chromosome 16 (see Figure

Transcription of MYH11 and NDE1

Transcription of MYH11 and NDE1

Microarrays continue to generate complex data for gene-expression. Clustering of both genes and samples is one of the most common analytical objectives – often achieved using spectral analysis of a matrix associated with the bipartite graph generated by the genes and samples and their corresponding links. Specifically, we first represent the activity of the i^{th }gene in the j^{th }sample as a positive value w_{ij }and then store these values in a rectangular matrix W. Then clustering of both genes and samples may be achieved using the singular value decomposition (SVD) of the matrix W, with the singular vectors corresponding to the second largest singular values providing the information to implement the clustering. These clustering techniques are heuristic and it is natural to ask how reliable they are. Using techniques from numerical linear algebra and probability analysis, it is possible to provide a sensitivity measure of the robustness of clustering using SVD. We use this sensitivity analysis to provide an answer to the above question about the expression of MYH11 and NDE1.

The advent of microarrays for all exons leads to new possibilities in identifying alternative transcripts and changes in the composition of mRNA and proteins. With these possibilities comes the challenge of reliably identifying candidates for alternative splicing and where possible suggesting "clusters" of co-expressed exons which can then be tested in the laboratory. The mathematical techniques used in the above work can help in this process.