Giter Club home page Giter Club logo

hotel-business-analysis's Introduction

Hotel Business Analysis

hotelbusiness

It is very important for a company to always analyze its business performance. On this occasion, we will delve deeper into business in the hospitality sector. Our focus is to find out how our customers behave in making hotel reservations, and its relationship to the rate of cancellation of hotel reservations. We will present the results of the insights we find in the form of data visualization to make it easier to understand and more persuasive.

This project will analyze hotels based on customer behavior using data visualization.

Points to Analyze

  • Monthly Booking Analysis Based on Hotel Type
  • Impact Analysis of Stay Duration on Hotel Bookings Cancellation Rates
  • Impact Analysis of Stay Duration on Hotel Bookings Cancellation Rates

Data Overview

Feature Name Description
hotel Type of hotel
is_canceled Value indicating if the booking was canceled (1) or not (0)
lead_time Number of days that elapsed between the entering date of the booking into the PMS and the arrival date
arrival_date_year Year of arrival date
arrival_date_month Month of arrival date with 12 categories: “January” to “December”
arrival_date_week_number Week number of the arrival date
arrival_date_day_of_month Day of the month of the arrival date
stays_in_weekend_nights Number of weekend nights (Saturday or Sunday) the guest stayed or booked to stay at the hotel
stays_in_weekdays_nights Number of weeknights (Monday to Friday) the guest stayed or booked to stay at the hotel BO and BL/Calculated by counting the number of weeknights
adults Number of adults
children Number of children
babies Number of babies
meal Meal menu
city City of origin
market_segment Market segment designation. In categories, the term “TA” means “Travel Agents” and “TO” means “Tour Operators”
distribution_channel Booking distribution channel. The term “TA” means “Travel Agents” and “TO” means “Tour Operators”
is_repeated_guest Value indicating if the booking name was from a repeated guest (1) or not (0)
previous_cancellations Number of previous bookings that were canceled by the customer prior to the current booking
previous_bookings_not_canceled Number of previous bookings not canceled by the customer prior to the current booking
booking_changes Number of changes/amendments made to the booking from the moment the booking was entered on the PMS until the moment of check-in or cancellation
deposit_type Type of deposit
agent ID of the travel agency that made the booking
company ID of the company/entity that made the booking or is responsible for paying the booking. ID is presented instead of designation for anonymity reasons
days_in_waiting_list Number of days the booking was in the waiting list before it was confirmed to the customer
customer_type TYpe of customer
adr Average Daily Rate (Calculated by dividing the sum of all lodging transactions by the total number of staying nights)
required_car_parking_spaces Number of car parking spaces required by the customer
total_of_special_requests Number of special requests made by the customer (e.g. twin bed or high floor)
reservation_status Status of reservation
  1. This hotel business has a large amount of customers,119,930 records with 29 features. But still, this dataset is raw and needs to be cleaned first before analysis.
  2. This dataset uses 2 types of hotels, they are City Hotel and Resort Hotel.
  3. There are 4 columns containing null values, the children, city, agent, and company features
  4. After exploring, the null and abnormal values will be analyzed more deeply to ensure our assumptions. We also need to check the unique value for categorical columns and other analysis
  5. In this case, company contains 94% null values, which means this column doesn’t give us important thing to know and lack of information. Delete this column is the best way. Furthermore, the duplicated rows and city have been deleted as well
  6. The missing and abnormal values will be replaced by appropriate values.

Data Analysis

1. Monthly Booking Analysis Based on Hotel Type

a.
rsvhotel
In this section, we’ll try to see the hotel booking trend based on hotel type, they are City Hotel and Resort Hotel.
The dataset tells us three types of reservation status after they make a reservation based on arrival date. The status is:

  • Check Out
  • Canceled
  • No Show

From the graph above, we can see that the City Hotel has more reservations including more check-outs, more canceled, and more no-shows than the Resort Hotel. The canceled reservations in city hotels are almost half of the total checkout so need to be analyzed further to avoid this in the future.

b.
monthlybooking

The graph shows the hotel booking trend from City Hotel and Resort Hotel for three years. Reservations seem to fluctuate but are increasing over time. The number of bookings from City Hotel began to exceed Resort Hotel in November 2017 and got more reservations over three years. We also can see the peak season around June to October but it is not the holiday season, and the majority of customers are not families. Therefore, it is likely that many customers are traveling for business purposes or reasons other than vacation.

2. Impact Analysis of Stay Duration on Hotel Bookings Cancellation Rates


a.
grup-stayduration

The next analysis is to see the cancellation rates based on the stay duration of customers. The stay duration starts from 1 to the longest 69 days. From the statistical data, the most stay duration has happened is around one to seven days (a week). The difference in stay duration seems very large, so it needs to be kept simpler by grouping the durations, they are :

  • Group 1 : 1-7 days
  • Group 2 : 8-14 days
  • Group 3 : 15-21 days
  • Group 4 : 22-30 days
  • Group 5 : More than 1 month

As we can see, the frequencies are too unequal, only group 1 is dominant.

b.
grup-stayduration-cancelrates
From the trends, some observations are : The least canceled bookings were on around 1-7 days. The good thing is that this is the group with the most frequent stays and the cancellation rate is also the smallest. Customers who stayed around 8-14 days have the highest cancellation rates. The cancellation rate for customers who stayed more than a month is also high, need to be aware of junk leads.

3. Impact Analysis of Stay Duration on Hotel Bookings Cancellation Rates


a.
grup-leadtime
As we can see from the analysis, the lead time range is way too large from 1 day to 2 years. Well, the 2 year lead-time seems quite nonsense and the customer counts are also few so we can group them as one group. Now, for a lead time of under a year, the most frequent is around 1 week which reaches thousands of customers.
So, we can group the lead time based on frequency and duration of lead time as follows :
a. 1-week (1-8 days)
b. 2-3 weeks (9-20 days)
c. 1-month (21-31 days)
d. 2 months
e. 3-4 months
f. 5-6 months
g. 7-9 months
h. 1 year (10-12 months)
i. >1 year

b.
grup-leadtime

No matter what type of hotel, the lead time really has a big impact on the cancellation rate.
For Resort Hotel, the cancellation rate seems quite stable around the thirties to seventies.
For City Hotel, the highest cancellation rate comes from 2 months of lead time (reaches 88.76%) and the lowest is a month (11.24%). It indicates that 1 month is an optimal lead time for city hotels.

Business Recommendation

From data analysis, we found several things that need to be improved. So, here are several business recommendations that can be made to improve the performance of the hotel business :

1. It is necessary to carry out further analysis of the cancellation rate of City Hotels.
2. Collaborate with local recreation or tourism parties to provide promotions for staying at hotels for customers who travel
3. For high cancellation rates, it is necessary to take precautions by setting a down payment for customers who will stay more than 1 week or even 1 month to prevent further profit losses.
4. Eliminate lead times of more than 4 months because basically, lead times of more than 1 month have a high cancellation rate. Suggest to customers to book a hotel about 1 or 2 weeks before staying. Then, if the distance between stays is still long, it is recommended to book about 1 month in advance. Then, if they book several months before their stay we need to do an analysis and ask why, because the customers could come from tourists or public figures or business teams who really have certain needs.

Thank You!

hotel-business-analysis's People

Contributors

yunanouv avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.