以下代码计算每年的唯一ID数.我的问题是:如何统计new个唯一ID,即前几年没有出现的ID?
group_by(year) %>%
summarize(count=n_distinct(ID))
例如,我需要在下面创建变量WANG_COUNT
Year | ID | count | wanted_count |
---|---|---|---|
2000 | 1 | 3 | 3 |
2000 | 2 | 3 | 3 |
2000 | 3 | 3 | 3 |
2001 | 2 | 2 | 0 |
2001 | 3 | 2 | 0 |
2002 | 3 | 2 | 1 |
2002 | 4 | 2 | 1 |
2003 | 4 | 2 | 1 |
2003 | 7 | 2 | 1 |
2003 | 4 | 2 | 1 |
请参阅下面的数据:
structure(list(Year = c(2000L, 2000L, 2000L, 2001L, 2001L, 2002L,
2002L, 2003L, 2003L, 2003L), ID = c(1L, 2L, 3L, 2L, 3L, 3L, 4L,
4L, 7L, 4L), count = c(3L, 3L, 3L, 2L, 2L, 2L, 2L, 2L, 2L, 2L
), wanted_count = c(3L, 3L, 3L, 0L, 0L, 1L, 1L, 1L, 1L, 1L)), class = "data.frame", row.names = c(NA,
-10L))