我有一个数据框,如下所示.我想先按ID和区域对数据帧进行分组,然后按内容过滤数据帧.我想保留在内容列中同时包含"高"和"中"的ID区域对.

我当前的数据帧:

structure(list(ID = c("a", "a", "a", "a", "a", "b", "b", "c", 
"c", "c", "c", "d", "d", "d"), Content = c("High", "Mid", "Low", 
"High", "Mid", "High", "Mid", "High", "Mid", "High", "Mid", "High", 
"Mid", "Low"), Count = c("3", "6", "2", "3", "7", "3", "5", "8", 
"5", "4", "2", "4", "9", "5"), Region = c("East", "East", "East", 
"West", "West", "East", "East", "East", "East", "West", "West", 
"West", "West", "West")), class = "data.frame", row.names = c(NA, 
-14L))

我的expect数据帧:

structure(list(ID = c("a", "a", "b", "b", "c", "c", "c", "c"), 
    Content = c("High", "Mid", "High", "Mid", "High", "Mid", 
    "High", "Mid"), Count = c("3", "7", "3", "5", "8", "5", "4", 
    "2"), Region = c("West", "West", "East", "East", "East", 
    "East", "West", "West")), class = "data.frame", row.names = c(NA, 
-8L))

我try 了下面的代码,但我不能保留同时具有"High"和"Mid"的ID-Region对.

newdata = data %>%
    group_by(ID,Region) %>%
    filter(n() != 1 & any(Content == "High") & any(Content == "Mid") & any(Content != "Low") %>%
    ungroup

推荐答案

我们可以过滤以保留任何不包含任何Content == Low且必须包含HighMid的组.

library(dplyr)

df %>%
  group_by(ID,Region) %>%
  filter(!any(Content == "Low") & all(c("High", "Mid") %in% Content) )

Output

  ID    Content Count Region
  <chr> <chr>   <chr> <chr> 
1 a     High    3     West  
2 a     Mid     7     West  
3 b     High    3     East  
4 b     Mid     5     East  
5 c     High    8     East  
6 c     Mid     5     East  
7 c     High    4     West  
8 c     Mid     2     West 

R相关问答推荐

从有序数据中随机抽样

R创建一个数据透视表,计算多个组的百分比

ggplot geom_smooth()用于线性回归虚拟变量-没有回归线

terra nearest()仅为所有`to_id`列返回NA

用derrr在R中查找组间的重复项

Ggplot2中的重复注记

在rpart. plot或fancyRpartPlot中使用带有下标的希腊字母作为标签?

打印XTS对象

当我们有多个反斜杠和/特殊字符时使用Gsubing

仅在R中的数据集开始和结束时删除所有 Select 列的具有NA的行

如何删除R中除数字元素以外的所有元素

是否有可能从边界中找到一个点值?

自定义交互作用图的标签

如何在R中使用因子行求和?

通过比较来自多个数据框的值和R中的条件来添加新列

如何捕获这个shiny 的、可扩展的react 性用户输入矩阵作为另一个react 性对象,以便进一步操作?

如何在给定的环境中找到函数的函数参数?

如何将数据框压缩为更宽,同时将行输入保持为行输入,而不是R中的列名?

如何从R调用Amazon销售合作伙伴API?

使用离散标签自定义图例,用于具有连续但已入库的数据的热图