merging tables with different structures

Question

I have two tables where I want to find the outer join based on a Ticker variable. In Table I, I have only one Ticker for each entity (fund), but in table II, I may have multiple records (multiple Ticker) for each &#8220;FundID&#8221;. The goal is to count the unique funds. I want to have table III, which is t…

Accepted Answer

You can first filter df2 for rows where for each FundID, none of their corresponding &#8220;Ticker&#8221; is in df1['Ticket']. Then among these FundIDs, sample one Ticker for each FundID and concatenate this to df1:sub_df2 = df2[~df2['Ticker'].isin(df1['Ticket']).groupby(df2['FundID']).cummax()]out = pd.concat((df1, sub_df2.groupby('FundID')['Ticker'].sample(n=1).to_frame().rename(columns={'Ticker':'Ticket'})))Output:  Ticket0      A1      B2      C3      D5      E

Ticker	FundID
A	1
AA	1
AB	1
B	2
BB	2
E	3
EB	3
EC	3

Advertisement

Answer