calcSpm               package:KCsmart               R Documentation

_K_C_s_m_a_r_t _w_r_a_p_p_e_r

_D_e_s_c_r_i_p_t_i_o_n:

     Wrapper function that calculates the sample point matrix from the
     aCGH data

_U_s_a_g_e:

     calcSpm(data, mirrorLocs, sigma = 1e+06, sampleDensity = 50000, maxmem = 1000)

_A_r_g_u_m_e_n_t_s:

    data: The aCGH data. Can either be in DNAcopy format or as a
          data.frame described in the details section 

mirrorLocs: List containing the chromosome start, centromere and end
          positions 

   sigma: The kernel width 

sampleDensity: The sample point matrix resolution 

  maxmem: This parameter controls memory usage, set to lower value to
          lower memory consumption 

_D_e_t_a_i_l_s:

     'data' can be in cghRaw (CGHbase), DNAcopy or in data.frame
     format. When using the latter, the data.frame must have the
     following two columns: 'chrom' stating the chromosome the probe is
     located on, 'maploc' describing the position on the chromosome of
     the probe. The remainder of the data.frame will be interpreted as
     sample data points. The row names of that data will be used as
     probe names (when available). Important note: the data can not
     contain any missing values. If your data includes missing values
     you will need to preprocess (for example impute) it using other
     software solutions.

     The mirror locations for Homo Sapiens and Mus Musculus are
     provided in the package. These can be loaded using
     data(hsMirrorLocs) and data(mmMirrorLocs)  respectively. The
     'mirrorLocs' object is a list with vectors containing the start,
     centromere (optional) and end of each chromosome as the list
     elements. Additionally it should  contain an attribute
     'chromNames' listing the chromosome names of each respective list
     element.

     'sigma' defines the kernel width of the kernel used to convolute
     the data.

     'sampleDensity' defines the resolution of the sample point matrix
     to be calculated. A sampleDensity of 50000 would correspond to a
     sample point every 50k base pairs.

_V_a_l_u_e:

     Returns a sample point matrix object. The object has several slots
     of which the 'data' slot contains a list where each list item
     represents a chromosome. Each list item in turn contains the
     sample point matrix for the gains and the losses separately and an
     attribute specifying the corresponding chromosome. The sample
     point matrix contains the following additional slots: totalLength:
     Total length of the sample point matrix maxy and miny: Maximal and
     minimal score attained

     The other slots just represent the parameters used to calculate
     the sample point matrix.

     Use 'plot' to plot the sample point matrix and 'findSigLevelTrad'
     to find a significance threshold. 'plotScaleSpace' can be used to
     plot the significant regions of multiple sample point matrices
     (using different sigmas).

_A_u_t_h_o_r(_s):

     Jorma de Ronde

_S_e_e _A_l_s_o:

     'plot',  'findSigLevelTrad', 'plotScaleSpace'

_E_x_a_m_p_l_e_s:

     data(hsSampleData)
     data(hsMirrorLocs)

     spm1mb <- calcSpm(hsSampleData, hsMirrorLocs)
     spm4mb <- calcSpm(hsSampleData, hsMirrorLocs, sigma=4000000)

     plot(spm1mb)
     plot(spm1mb, chromosomes=c(1,5,6,'X'))

