+-------------+------+
| Column Name | Type |
+-------------+------+
| hall_id | int |
| start_day | date |
| end_day | date |
+-------------+------+
This table may contain duplicates rows.
Each row of this table indicates the start day and end day of an event and the hall in which the event is held.
Write a solution to merge all the overlapping events that are held in the same hall. Two events overlap if they have at least one day in common.
Input:
HallEvents table:+---------+------------+------------+| hall_id | start_day | end_day |+---------+------------+------------+|1|2023-01-13|2023-01-14||1|2023-01-14|2023-01-17||1|2023-01-18|2023-01-25||2|2022-12-09|2022-12-23||2|2022-12-13|2022-12-17||3|2022-12-01|2023-01-30|+---------+------------+------------+Output:
+---------+------------+------------+| hall_id | start_day | end_day |+---------+------------+------------+|1|2023-01-13|2023-01-17||1|2023-01-18|2023-01-25||2|2022-12-09|2022-12-23||3|2022-12-01|2023-01-30|+---------+------------+------------+Explanation: There are three halls.Hall 1:- The two events ["2023-01-13","2023-01-14"] and ["2023-01-14","2023-01-17"] overlap. We merge them in one event ["2023-01-13","2023-01-17"].- The event ["2023-01-18","2023-01-25"] does not overlap with any other event, so we leave it as it is.Hall 2:- The two events ["2022-12-09","2022-12-23"] and ["2022-12-13","2022-12-17"] overlap. We merge them in one event ["2022-12-09","2022-12-23"].Hall 3:- The hall has only one event, so we return it. Note that we only consider the events of each hall separately.
To merge overlapping events in the same hall, sort events by hall and start day, then use window functions to group overlapping intervals. For each hall, assign a group to each event where the event overlaps with the previous one, and then aggregate the minimum start and maximum end for each group.
WITH ordered AS (
SELECT hall_id, start_day, end_day,
ROW_NUMBER() OVER (PARTITION BY hall_id ORDERBY start_day) AS rn
FROM HallEvents
),
groups AS (
SELECT hall_id, start_day, end_day, rn,
SUM(CASEWHEN start_day > LAG(end_day) OVER (PARTITION BY hall_id ORDERBY start_day) THEN1ELSE0END)
OVER (PARTITION BY hall_id ORDERBY start_day) AS grp
FROM ordered
)
SELECT hall_id, MIN(start_day) AS start_day, MAX(end_day) AS end_day
FROM groups
GROUPBY hall_id, grp;
1
2
3
4
5
6
7
8
9
10
11
12
13
14
WITH ordered AS (
SELECT hall_id, start_day, end_day,
ROW_NUMBER() OVER (PARTITION BY hall_id ORDERBY start_day) AS rn
FROM HallEvents
),
groups AS (
SELECT hall_id, start_day, end_day, rn,
SUM(CASEWHEN start_day > LAG(end_day) OVER (PARTITION BY hall_id ORDERBY start_day) THEN1ELSE0END)
OVER (PARTITION BY hall_id ORDERBY start_day) AS grp
FROM ordered
)
SELECT hall_id, MIN(start_day) AS start_day, MAX(end_day) AS end_day
FROM groups
GROUPBY hall_id, grp;