如何在 R 中查找 data.table 对象列中每个类别的百分比?
r programmingserver side programmingprogramming更新于 2025/4/7 10:37:17
要在 R 中查找 data.table 对象中每个类别的百分比,我们可以按照以下步骤操作 −
首先,创建一个 data.table 对象。
然后,在将 n 和 nrow 分组后,使用 dplyr 包的 summarise 函数。
示例
创建 data.table 对象
让我们创建一个 data.table 对象,如下所示 −
library(data.table) Factor<-sample(c("very low","low","medium","high","very high"),25,replace=TRUE) Response<-rnorm(25) DT<-data.table(Factor,Response) DT
输出
执行时,上述脚本将生成以下输出(由于随机化,此输出将因您的系统而异) −
Factor Response 1: low -1.61215323 2: very high -0.44482842 3: very low -0.08886876 4: very high -0.23859749 5: high 0.24527395 6: low 1.36309052 7: low 0.85434533 8: low -0.04598830 9: high -0.09693464 10: low -0.66470019 11: high -0.27726887 12: medium 0.30070368 13: very low -1.10935270 14: high 2.05570820 15: low -0.61390253 16: low 1.04354500 17: high -1.74834030 18: very low -0.84208037 19: medium 2.26732484 20: very high -0.30309612 21: low 0.72720852 22: high -2.52034038 23: very low -1.50381843 24: high -0.61927487 25: high 0.14854389 Factor Response
查找 data.table 对象中每个类别的百分比
使用 dplyr 包的 summarise 函数,在分组后结合 n 和 nrow 查找 data.table 对象 DT 的 Factor 列中每个类别的百分比 −
library(data.table) Factor<-sample(c("very low","low","medium","high","very high"),25,replace=TRUE) Response<-rnorm(25) DT<-data.table(Factor,Response) library(dplyr) DT %>% group_by(Factor) %>% summarise(Percent=n()/nrow(.))
输出
# A tibble: 5 x 2 Factor Percent <chr> <dbl> 1 high 0.32 2 low 0.32 3 medium 0.08 4 very high 0.12 5 very low 0.16