Merging pandas DataFrames generated with a loop on SQL Database Data

Question

This works BUT the outputs are not matching on the index (Date). Instead the new columns are added but start at the first dataframes last row i.e. the data is stacked &#8220;on top&#8221; of each other so the Date index is repeated. Is there a way to iterate and create columns that are matched by Date? Output…

Accepted Answer

Just based on what your current output is and what I think your desired output is, I think you can get away with just a df.groupby('Date').sum(). Running that will group any duplicates in the &#8216;Date&#8217; column and do a sum on all the values it finds for each column. If I&#8217;m understanding right, each column will only have a single value for the date-row, so it&#8217;ll &#8216;sum&#8217; that single number: that is, it&#8217;ll return that number.I copied the little output section you have above (and removed the blank rows) and just did df.groupby('Date').sum() and got this:           S&P500  S&P600Date10/1/1995  -0.004  -0.05011/1/1995   0.044   0.03812/1/1995   0.019   0.0164/1/2020    0.128   0.1265/1/2020    0.048   0.0416/1/2020    0.020   0.0367/1/2020    0.056   0.0409/1/1995    0.042   0.025

Advertisement

Answer