根据来自不同列的位置计算数据框中的平均值

时间:2015-04-21 09:08:40

标签: r dataframe mean

我的数据框设置如下:

N1 <- c(1,2,4,3,2,3,4,5,4,3,4,5,4,5,6,8,9)
Start <- c("","Start","","","","","","","Start","","","","Start","","","","")
Stop <- c("","","","","Stop","","","","","","Stop","","","","Stop","","")

N1是我感兴趣的数据。我想根据接下来两列中的“开始”和“停止”位置计算一串数字的平均值。

“开始”和“停止”定义的字符串如下所示:

2,4,3,2 
4,3,4
4,5,6

所以我的最终结果应该是3个意思:

    2.75,3.6,5

3 个答案:

答案 0 :(得分:5)

你可以尝试:

mapply(function(start, stop){
          mean(N1[start:stop])
       }, 
       start=which(Start!=""), 
       stop=which(Stop!=""))

#[1] 2.750000 3.666667 5.000000

答案 1 :(得分:4)

library(data.table) # need latest 1.9.5+

# set up data to have all 1's column for the period we're interested in and 0 otherwise
d = data.table(N1, event = cumsum((Start != "") - c(0, head(Stop != "", -1))))

d[, mean(N1), by = .(event, rleid(event))][event == 1, V1]
#[1] 2.750000 3.666667 5.000000

# or equivalently
d[, .(event[1], mean(N1)), by = rleid(event)][V1 == 1, V2]

答案 2 :(得分:2)

您也可以尝试rollapply

library(zoo)
x <- sort(c(which(Stop != ""), which(Start != ""))) # indices of Start and Stop
rollapply(x, 2, FUN = function(y) mean(N1[y[1]:y[2]]), by=2)
[1] 2.750000 3.666667 5.000000
相关问题