Tag: pandas

Finding unique number of IDs in multiple groups

I have a dataset that has doctors and the various practices they work in. Each doctor in my dataset works in at least 1 practice but as many as 17 different practices. I would like to know the unique …

Python problem with adding table to Database with sql

pandas python sql sqlalchemy

Hi, I’m trying to create a function that will take a table in insert(add) to the database. My Code : this is the old code when i didn’t create a function: Answer You seem to have put table_name under quotations in your to_sql function. Also, you seem to be tackling the issue of data model creation and maintenance, which already

Pandas dataframe combine unique row values

dataframe pandas pandas-groupby python sql

I have a dataframe like the following with over 90000 rows. As you can see, some origin and destination values repeat for example there are multiple rows where origin=101011001, destination=101011002. My goal is to group the repeating origin and destination values and sum the the people column, so the dataframe looks like this: I’ve tried jsondf.groupby([‘origin’, ‘destination’]).sum() which gives me

Selective summation of columns in a pandas dataframe

pandas python sql

The COVID-19 tracking project (api described here) provides data on many aspects of the pandemic. Each row of the JSON is one day’s data for one state. As many people know, the pandemic is hitting different states differently — New York and its neighbors hardest first, with other states being hit later. Here is a subset of the data: To

Unable to use ‘read_sql’ to call a SQL query class

pandas pyodbc python-3.x sql

I am trying to pull results from the database with the following code: I get an error: I saw a similar question at TypeError: ‘pyodbc.Cursor’ object is not callable (Python 3.6) but unable to get an answer from there. Answer I got it to work by editing the class from into

Moving all rows with a certain index into a single row

pandas python sql sql-server

I have a table with as structure like the following, with an unknown number of rows with each group index. Group || PropertyA || PropertyB || PropertyC ============================================ …

Trying to insert a python input command into my SQL query

input pandas python sql

So this is what I’m trying to do I just want to know if there’s a way to link the date input command to my SQL query so that anyone that runs the code can enter a date and get the info for only that specific date. (super-duper noob here) Answer To get the date for the input to commend

SQL & Pandas Efficiency [closed]

dataframe pandas python sql sql-server

Closed. This question is opinion-based. It is not currently accepting answers. Want to improve this question? Update the question so it can be answered with facts and citations by editing this post. Closed 2 years ago. Improve this question Quick question. What is the rule of thumb when deciding where to begin manipulating data? Should I do it when I

Python Pandas non equal join

analytics pandas python self-join sql

Have table OUT Need: so i need make non equal join equivalent SQL query : or as SQL query: The problem in PANDAS is that NON EQUAL SELF JOIN cannot be done with MERGE. And I can’t find another way….. Answer We can solve this in pandas in a smarter way by using groupby with agg and joining the strings.

Calculate TimeDiff in Pandas based on a column values

pandas python sql

Having a dataframe like that: Desirable result is to get aggregated IDs with time diffs between Start and End looking like that: Tried simple groupings and diffs but it does not work: How this task can be done in pandas? Thanks! Answer A possible solution is to join the table on itself like this: Output: