Pandas dataframe combine unique row values

Question

I have a dataframe like the following with over 90000 rows. As you can see, some origin and destination values repeat for example there are multiple rows where origin=101011001, destination=101011002. My goal is to group the repeating origin and destination values and sum the the people column, so the datafra…

Accepted Answer

A quick and easy way to get each of your origin values to display would be to simply reset your index after using the groupby. Here is an example that shows what the database looks like before and after resetting the index:df.groupby(['origin', 'destination']).sum()origin      destination  people101011001   101011001    7378            101011002    413            101011003    7            101011004    12101011002   101011001    5Once you add the reset_index(), then the dataframe will have each value of origin represented in every row.     df.groupby(['origin', 'destination']).sum().reset_index()    origin      destination people0   101011001   101011001   73781   101011001   101011002   4132   101011001   101011003   73   101011001   101011004   124   101011002   101011001   5This should allow you to send to the sql database without interpreting the origin as null values.

Advertisement

Answer