How do I use ClassMarker ?

 
Introduction
How to Proceed

Introduction

    ClassMarker operates on expression data measured on a number of genes in samples of different classes of cells. A class may be a particular tissue (e.g. brain, muscle, tumor), a particular disease status (e.g. normal, diseased, a particular stage of a disease), a particular disease (e.g. various types of cancer), etc. The membership of each sample in its class must be known with as high a degree of confidence as possible, for instance on the basis of histology, the known evolution of the disease, or other means of diagnosis.
    The data supplied to ClassMarker thus form a matrix where each row corresponds to a gene and each column to a sample. An element in the matrix contains the expression level of the row’s gene measured in the column’s sample. Each of the columns is furthermore classified into one of the sample classes. In order to be effective, the method used in ClassMarker needs data from as many samples in each class as possible. It is worth noting that the assignment of samples to classes is easily specified with the rich graphical interface available in ClassMarker (learn more on class creation and experiment reordering).

     Provided enough samples of known class are supplied, unclassified samples (samples of unknown class) may be supplied as well, in form of additional columns. ClassMarker will attempt to classify those samples into one of the known classes.
    When the data are read in, various filters can be specified (e.g. min/max expression level, minimal fold change, logarithmic transformation). This may modify the expression values taken into account, or eliminate genes from the analysis. The impact of the filters is conveniently monitored by the graphical interface.

    Once the filters are specified, the proper analysis can start. Two types of analysis are available:

  • identification of marker genes and assessment of their quality by cross-validation
  • identification of marker genes and assessment of their quality by train-and-test evaluation.

    In both cases, marker genes are identified on the basis of a subset of the classified genes and used to classify the rest of the genes – the quality of the markers is then assessed by the success in classifying each sample into its proper class. For the latter stage of classification, two classification techniques are available:

  • a proprietary voting method
  • a k-Nearest-Neighbors classification.

    While the choice of the classification technique depends on the data, the first is usually the best. When it is used, it is possible to take into account couples of classes in identification of the markers and the subsequent classification.
    ClassMarker offers a unique graphical interface that allows for a deep analysis of the data. Individual samples can be excluded from the analysis, classes can be merged and split, etc., with unparalleled ease. This enables the scientist to test many hypotheses and identify promising target genes in record time.

 

How to Proceed

1) Load your data with one of the following choices:

2) Create your classes

  • Specify filter parameters if necessary
  • Reorder your data if necessary.
  • Create the different classes.

3) Start an analysis

4) Analyze your result

  • Switch to the freshly created Result Tab.
  • Use the different views in the Result Tab to analyse the result.

5) Add other analyses

  • Add other analyses if necessary, by repeating the sequence from step 3. Various scenarii are easily obtained by

    Note that modifying the Master Tab in order to run an analysis with a different scenario does not invalidate the results obtained with previous scenarii, since all the necessary information is stored with the result in each Result Tab. This allows for a very easy comparison of results obtained with various scenarii by simple switching from one Result Tab to another.

6) Save your solution

  • Use the Save button of the toolbar or the Save command in the file menu to save your project as an AMP file. An AMP file contains all the details of a ClassMarker run, when it is subsequently reloaded into ArrayMiner, ClassMarker is set into the state in which the file was saved.