I’ve been trying to solve an issue for the past couple of days, but couldn’t figure out what the solution would be…
I have a table as the following:
+--------+-----------+-------+ | ShopID | ArticleID | Price | +--------+-----------+-------+ | 1 | 3 | 150 | | 1 | 2 | 80 | | 3 | 3 | 100 | | 4 | 2 | 95 | +--------+-----------+-------+
And I woud like to select pairs of shop IDs for which the price of the same article is higher. F.e. this should look like:
+----------+----------+---------+ | ShopID_1 | ShopID_2 |ArticleID| +----------+----------+---------+ | 4 | 1 | 2 | | 1 | 3 | 3 | +----------+----------+---------+
… showing that Article 2 ist more expensive in ShopID 4 than in ShopID 2. Etc
My code so far looks as following:
SELECT ShopID AS ShopID_1, ShopID AS ShopID_2, ArticleID FROM table WHERE table.ArticleID=table.ArticleID and table.Price > table.Price
But it doesn’t give the result I am searching for.
Can anyone help me with this objective? Thank you very much.
Advertisement
Answer
The problem here is about calculating Top N items per Group.
Assuming you have the following data, in table sales
.
# select * from sales; shopid | articleid | price --------+-----------+------- 1 | 2 | 80 3 | 3 | 100 4 | 2 | 95 1 | 3 | 150 5 | 3 | 50
With the following query we can create a partition for each ArticleId
select ArticleID, ShopID, Price, row_number() over (partition by ArticleID order by Price desc) as Price_Rank from sales;
This will result:
articleid | shopid | price | price_rank -----------+--------+-------+------------ 2 | 4 | 95 | 1 2 | 1 | 80 | 2 3 | 1 | 150 | 1 3 | 3 | 100 | 2 3 | 5 | 50 | 3
Then we simply select Top 2 items for each AritcleId:
select ArticleID, ShopID, Price from ( select ArticleID, ShopID, Price, row_number() over (partition by ArticleID order by Price desc) as Price_Rank from sales) sales_rank where Price_Rank <= 2;
which will result:
articleid | shopid | price -----------+--------+------- 2 | 4 | 95 2 | 1 | 80 3 | 1 | 150 3 | 3 | 100
Finally, we can use crosstab
function to get the expected pivot view.
select * from crosstab( 'select ArticleID, ShopID, ShopID from ( select ArticleID, ShopID, Price, row_number() over (partition by ArticleID order by Price desc) as Price_Rank from sales) sales_rank where Price_Rank <= 2') AS sales_top_2("ArticleID" INT, "ShopID_1" INT, "ShopID_2" INT);
And the result:
ArticleID | ShopID_1 | ShopID_2 -----------+----------+---------- 2 | 4 | 1 3 | 1 | 3
Note:
You may need to call CREATE EXTENSION tablefunc;
in case if you get the error function crosstab(unknown) does not exist
.