Search   Memberlist   Usergroups
 Page 1 of 1 [2 Posts]
Author Message
Stephen J. Herschkorn
science forum Guru

Joined: 24 Mar 2005
Posts: 641

Posted: Wed Jul 19, 2006 3:56 pm    Post subject: Re: modelling method for categorical data

In sci.math, Arby wrote:

 Quote: Hello all, This is my first post to this list, my question is actually about statistics although I'm no statistician so if I don't get the correct terminology then forgive me but I'll try. What I want to know is, what would be the right type of statistical modelling for nominal/categorical data? Put another way what is the appropriate method for predicting a nominal response variable from a number of categorical explanatory variables? I am actually a biologist and what I have is a dataset that records gains and losses of regions of DNA from 400 colorectal cancer patients. More specifically, the whole human genome is divided into 862 segments, each segment is scored as 0 = no change, -1 = loss, 1 =gain, 2 = high level amplification, for each of the 400 patients. In general terms my question would be: can I predict the status of 1 of the 862 segments if I know the status of the other 861 segments? That would be what I think is called the maximal model, what is the process for reducing this to the minimal number of segments necessary to predict a given segment with a set level of certainty? Having looked through some basic stats books the nearest thing I've found is ANOVA but this confused as it talked about calculating means and I don't see how you can have a mean of a nominal variable (this may be the crux of my problem). I'm not looking for complete solutions, I just don't have the correct vocabulary to describe what I need so I'd be grateful if someone could tell me what the appropriate terms are and hopefully provide some beginners references so I can go away and learn how to do it. regards, Richard

This post is more germane to the newsgroup sci.stat, where I have

--
Stephen J. Herschkorn sjherschko@netscape.net\
Math Tutor on the Internet and in Central New Jersey and Manhattan
Arby
science forum beginner

Joined: 19 Jul 2006
Posts: 1

Posted: Wed Jul 19, 2006 12:09 pm    Post subject: modelling method for categorical data

Hello all,

This is my first post to this list, my question is actually about
statistics although I'm no statistician so if I don't get the correct
terminology then forgive me but I'll try. What I want to know is, what
would be the right type of statistical modelling for
nominal/categorical data? Put another way what is the appropriate
method for predicting a nominal response variable from a number of
categorical explanatory variables?

I am actually a biologist and what I have is a dataset that records
gains and losses of regions of DNA from 400 colorectal cancer patients.
More specifically, the whole human genome is divided into 862 segments,
each segment is scored as 0 = no change, -1 = loss, 1 =gain, 2 = high
level amplification, for each of the 400 patients.

In general terms my question would be: can I predict the status of 1 of
the 862 segments if I know the status of the other 861 segments? That
would be what I think is called the maximal model, what is the process
for reducing this to the minimal number of segments necessary to
predict a given segment with a set level of certainty? Having looked
through some basic stats books the nearest thing I've found is ANOVA
but this confused as it talked about calculating means and I don't see
how you can have a mean of a nominal variable (this may be the crux of
my problem).

I'm not looking for complete solutions, I just don't have the correct
vocabulary to describe what I need so I'd be grateful if someone could
tell me what the appropriate terms are and hopefully provide some
beginners references so I can go away and learn how to do it.

regards,
Richard

 Display posts from previous: All Posts1 Day7 Days2 Weeks1 Month3 Months6 Months1 Year Oldest FirstNewest First
 Page 1 of 1 [2 Posts]
 The time now is Sat Feb 16, 2019 9:38 pm | All times are GMT
 Jump to: Select a forum-------------------Forum index|___Science and Technology    |___Math    |   |___Research    |   |___num-analysis    |   |___Symbolic    |   |___Combinatorics    |   |___Probability    |   |   |___Prediction    |   |       |   |___Undergraduate    |   |___Recreational    |       |___Physics    |   |___Research    |   |___New Theories    |   |___Acoustics    |   |___Electromagnetics    |   |___Strings    |   |___Particle    |   |___Fusion    |   |___Relativity    |       |___Chem    |   |___Analytical    |   |___Electrochem    |   |   |___Battery    |   |       |   |___Coatings    |       |___Engineering        |___Control        |___Mechanics        |___Chemical

 Topic Author Forum Replies Last Post Similar Topics Help in identifying a numerical method Don11135 num-analysis 2 Thu Jul 20, 2006 8:56 pm troubles in determination of specific surface area(air pe... eos Chem 0 Thu Jul 20, 2006 10:05 am troubles in determination of specific surface area(air pe... eos Chem 0 Thu Jul 20, 2006 10:02 am possible to use Generalized Method of Moments for this pr... comtech Math 1 Thu Jul 20, 2006 12:49 am Power Method and negative eigenvalues Skunk num-analysis 7 Wed Jul 19, 2006 10:19 pm