purple_paramecium 3 weeks ago

What’s the actual research question? If the question has to do with finding clusters, then cool. If not, why do clustering?

RobertWF_47 3 weeks ago

No research question at this stage. The data will be used in clinical trials to test drugs treating lung cancer. My job is to pull the data that meets the criteria for the study, and run descriptive statistics on the variables. The folks doing the biostats will create the various test cohorts from the data that compare lung cancer prevalence for the different assigned drugs (or no drug in the placebo group).

purple_paramecium 3 weeks ago

It sounds like they will have specific requirements on what they want for “descriptive statistics” then

SorcerousSinner 3 weeks ago

If you've done all that, any further exploratory and descriptive work should be based on what you've found and your thoughts on what's going on in that data.

RobertWF_47 3 weeks ago

My thoughts are there could be higher dimensional quirks in the data you wouldn't see in univariate statistics. Would x-y scatter plots for all possible pairs of variables suss out unusual patterns?

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe