Austrian Hotels Dataset
Overview
A realistic simulated dataset of hotels across Austria for practicing data wrangling and table joins. Contains multiple related tables with hotels, cities, occupancy, tourism, and economic data.
Used in: Week 4 (Joining Tables)
Generated by: Claude AI (Sonnet 3.7) with realistic relationships between variables
Data Files
The dataset includes 8 related tables:
| File | Description | Rows |
|---|---|---|
hotels_modified.csv |
Basic hotel information | 200 |
cities_modified.csv |
City information | 10 |
monthly_occupancy_modified.csv |
Monthly hotel performance | ~3,800 |
city_tourism_modified.csv |
Monthly tourism stats | 240 |
economic_indicators.csv |
Monthly economic indicators | 24 |
reviews_modified.csv |
Hotel guest reviews | ~1,700 |
amenities.csv |
List of hotel amenities | 10 |
hotel_amenities_modified.csv |
Hotel-amenity relationships | ~1,000 |
Documentation
- hotel-data-readme.md - Detailed schema documentation
Code
Code examples to be added
Key Relationships
- One-to-One: Hotels to Cities (through city name)
- One-to-Many: Hotels to Monthly Occupancy, Hotels to Reviews
- Many-to-Many: Hotels to Amenities (through
hotel_amenities) - Composite Keys: Monthly data uses
(hotel_id, month, year)
Learning Objectives
- Inner, left, right, and full joins
- One-to-one and one-to-many relationships
- Composite key joins
- Data aggregation after joins
- Handling missing values in joins
Downloads
- data-modified.zip - All data files