R – get a vector that tells me if a value of another vector is the first appearence or not

Question

I have a data frame of sales with three columns: the code of the customer, the month the customer bought that item, and the year. A customer can buy something in september and then in december make another purchase, so appear two times. But I'm interested in knowing the absolutely new customoers by month and year. So I have thought

Accepted Answer

I create dummy data my self with id, month of numeric format, and yeardat <-data.frame( id = c(1,2,3,4,5,6,7,8,1,3,4,5,1,2,2), month = c(1,6,7,8,2,3,4,8,11,1,10,9,1,12,2), year = c(2019,2019,2019,2019,2019,2020,2020,2020,2020,2020,2021,2021,2021,2021,2021)) id month year1 1 1 20192 2 6 20193 3 7 20194 4 8 20195 5 2 20196 6 3 20207 7 4 20208 8 8 20209 1 11 202010 3 1 202011 4 10 202112 5 9 202113 1 1 202114 2 12 202115 2 2 2021Then, group by id and arrange by year and month (order is meaningful). Then use filter and row_number().dat %>% group_by(id) %>% arrange(year, month) %>% filter(row_number() == 1) id month year 1 1 1 20192 5 2 20193 2 6 20194 3 7 20195 4 8 20196 6 3 20207 7 4 20208 8 8 2020

Advertisement

Answer