Skip to content

WestCoastGod/HongKong-Stargazing-Analysis

Repository files navigation

Hong Kong Stargazing Analysis Project

Introduction

Stargazing in Hong Kong is challenged by severe light pollution, making it difficult to observe celestial objects except in rural areas under optimal conditions. This project aims to use data analysis and machine learning skills to find the correlations between stargazing levels and weather conditions and build the foundation for helping users find the best times and locations for stargazing with less manual effort.

Methodology

  • Data Sources:

  • Data Processing:

    • Aggregated minute-level sky brightness data to daily values to match daily weather data.
    • Applied corrections for city light using VIIRS satellite data and the natural night sky brightness value from the Hong Kong Space Museum Astropark in the GaN-MN dataset.
    • Final dataset: 936 rows, 25 features (e.g., cloud amount, humidity, wind speed, moon phase).
  • Analysis:

    • Explored correlations and trends among features.
    • Built regression models (Linear Regression, Random Forest) to predict daily night sky brightness.
    • Evaluated models using Mean Squared Error (MSE), R-squared, and K-fold cross-validation.
    • Assessed feature importance to identify key influencing factors.

Results

  • Model Performance:
    • Random Forest outperformed Linear Regression (lower MSE, higher R²).
  • Key Findings:
    • Most important negative factors: cloud amount, moon phase, wind speed, sunshine hours, relative humidity.
    • Best stargazing: clear skies, no moon, calm winds, short sunshine hours in the day, low humidity.
  • Limitations:
    • Some uncertainty remains due to city light and model simplicity.

Future Work

Develop a real-time tool to synthesize weather data and output a stargazing Bortle scale for Hong Kong.

Supplementary

  1. Link to SEEM2460 Project Proposal
  2. Link to SEEM2460 Project Report
  3. Links to Pre-processed Raw Data:
  4. Link to Processed Data:
    • filtered_data.zip: Include daily_nsb.csv, combined_sun, combined_moon, and nsb_weather_merged
    • sky_brightness.zip: Include every month's sky brightness data from January 2022 to October 2024
    • weather.zip: Include different kinds of weather data
  5. The dataset contains features from 1-1-2022 to 31-10-2024
  6. nsb_weather_merged Dataset features:
Max Night Sky Brightness (MPSAS) Min Night Sky Brightness (Non-zero) (MPSAS) Mean Night Sky Brightness (Excluded zero) (MPSAS) Daily Mean Hong Kong Heat Index Daily Mean Wet Bulb Temperature (°C)
Daily Mean Dew Point Temperature (°C) Daily Mean Amount of Cloud Daily Mean Pressure (hPa) Daily Total Evaporation (mm) Daily Total Rainfall (mm)
Daily Mean Relative Humidity (%) Daily Maximum Temperature (°C) Daily Minimum Temperature (°C) Daily Mean Temperature (°C) Daily Global Solar Radiation (MJ/m2)
Daily Total Bright Sunshine (hours) Daily Mean Wind Speed (m/s) Sun Rise Sun Transit Sun Set
Moon Rise Moon Transit Moon Set