ChaseDream
搜索
123下一页
返回列表 发新帖
查看: 4919|回复: 20
打印 上一主题 下一主题

[读书的日子] 求教大家关于R软件的

[精华] [复制链接]
跳转到指定楼层
楼主
发表于 2012-5-12 06:13:56 | 只看该作者 回帖奖励 |倒序浏览 |阅读模式
R软件很难学吗 本人没编程背景 刚接了一个project 要学R统计软件 大家有什么自学窍门吗
收藏收藏1 收藏收藏1
沙发
发表于 2012-5-12 06:24:30 | 只看该作者
你可能需要先告诉别人具体是用R做些什么分析。
板凳
 楼主| 发表于 2012-5-12 06:29:00 | 只看该作者
I am attaching 3 files here:  the input CSV file which we discussed today, the R code, and the PPT containing the variable distributions in the original population.  It would be great if the script could be set up such that the user inputs the following:



(1)    Range of acceptable values for each variable in each level (in our example, there are 8 levels)

(2)    # of samples we want to select from each level (in this case, 13)

(3)    The CSV file



And the script will produce as outputs:

(1)    The seed that produced the final set of 13 samples for each level

(2)    The list of selected sample IDs (in this case, 13) for each level.  In this case, the sample IDs are the ‘Sitesubj’ variable in the CSV file.

(3)    lots and tables showing the final variable values (all scaled in %) for each level



Something else I forgot to mention, which might be of help is that you can set the y-axis range for the boxplots using the command:  

boxplot(s1$AGE, ylim=c(10,77))



This helps when comparing the original boxplot to the sample boxplot, since you can set their y-axes to be the same.



Also, you had asked about the continuous variables (what to use for range).  I should have asked if you were familiar with boxplots, but here is a brief reference:



http://www.childrensmercy.org/stats/definitions/boxplot.htm



There are 5 numbers that comprise a boxplot—so you could pick some of them to use for setting your ranges.  I would suggest trying the median, mean, and box length, since these are the most important components
地板
 楼主| 发表于 2012-5-12 06:29:56 | 只看该作者
你可能需要先告诉别人具体是用R做些什么分析。
-- by 会员 猪之哥 (2012/5/12 6:24:30)


嗯 我把问题贴出来了 谢谢 我一点头绪都没有 教授让我先学R
5#
发表于 2012-5-12 09:35:49 | 只看该作者
去看一些Statistics的课,比如UC-Berkeley的公开课,就有教R的。

可以问问你教授,直接Stata或者SAS不行么?R学会了,如果不用也会忘掉的吧?
6#
发表于 2012-5-12 10:11:39 | 只看该作者
有教stata的公开课么?
7#
 楼主| 发表于 2012-5-12 10:22:33 | 只看该作者
去看一些Statistics的课,比如UC-Berkeley的公开课,就有教R的。

可以问问你教授,直接Stata或者SAS不行么?R学会了,如果不用也会忘掉的吧?
-- by 会员 摇一摇 (2012/5/12 9:35:49)

                   

不懂哎 这是一个医药公司的活 那人说要用R给他做 教授就让我自己学R 我都不知道从哪入手......
8#
发表于 2012-5-12 10:56:27 | 只看该作者
我以前学R的时候很喜欢的网站:
http://www.statmethods.net/
我觉得这个比R的官方网站清楚,上手快。
9#
发表于 2012-5-12 10:57:16 | 只看该作者
thanks for sharing
10#
发表于 2012-5-12 10:58:50 | 只看该作者
I am attaching 3 files here:  the input CSV file which we discussed today, the R code, and the PPT containing the variable distributions in the original population.  It would be great if the script could be set up such that the user inputs the following:



(1)    Range of acceptable values for each variable in each level (in our example, there are 8 levels)

(2)    # of samples we want to select from each level (in this case, 13)

(3)    The CSV file
这部分可见:http://www.statmethods.net/input/importingdata.html



And the script will produce as outputs:

(1)    The seed that produced the final set of 13 samples for each level

(2)    The list of selected sample IDs (in this case, 13) for each level.  In this case, the sample IDs are the ‘Sitesubj’ variable in the CSV file.

(3)    lots and tables showing the final variable values (all scaled in %) for each level
这部分可见:http://www.statmethods.net/management/index.html
http://www.statmethods.net/graphs/index.html



Something else I forgot to mention, which might be of help is that you can set the y-axis range for the boxplots using the command:  

boxplot(s1$AGE, ylim=c(10,77))



This helps when comparing the original boxplot to the sample boxplot, since you can set their y-axes to be the same.



Also, you had asked about the continuous variables (what to use for range).  I should have asked if you were familiar with boxplots, but here is a brief reference:



http://www.childrensmercy.org/stats/definitions/boxplot.htm



There are 5 numbers that comprise a boxplot—so you could pick some of them to use for setting your ranges.  I would suggest trying the median, mean, and box length, since these are the most important components
-- by 会员 阿尔瓦雷斯 (2012/5/12 6:29:00)

您需要登录后才可以回帖 登录 | 立即注册

Mark一下! 看一下! 顶楼主! 感谢分享! 快速回复:

近期活动

正在浏览此版块的会员 ()

手机版|ChaseDream|GMT+8, 2024-5-10 23:23
京公网安备11010202008513号 京ICP证101109号 京ICP备12012021号

ChaseDream 论坛

© 2003-2023 ChaseDream.com. All Rights Reserved.

返回顶部