7
$\begingroup$

I am seeking some help and or perspectives in solving a problem.

I have a dataset (accessible here) with the following columns:

  • DATE: this is the date in dd/mm/yyyy format
  • HH: this is the "half-hour" window of the day. In a day there are 48 HH windows starting from 12 AM - 12:30 AM, 12:30 AM - 01:00 AM, and so on.
  • WIND_SPEED_MS: this is the forecasted wind speed in m/s
  • GENERATION_MW: this is the actual power generated by a windfarm in MW
  • SEASONAL_NORMAL_WIND_SPEED_KNOTS: this is the long-term, historically averaged, windspeed for a given time of day and day of year, in knots (1 nautical mile per hour)

The goal: To correlate wind power generation (MWh) to the forecasted wind speed (m/s), so that for a future wind speed forecast, I may predict the expected power output reliably.

Optional side questions:

  1. What explains the power generation at very high wind speeds (when in reality most turbines should all be stopped for safety reasons beyond 30-35 m/ s)?
  2. What explains really high levels of wind generation, far above the "ceiling" that can be observed in the scatter plot?
  3. Why is the seasonal normal wind speed in knots, when converted to m/s, so low. It doesn't seem right.

Your help would be greatly appreciated!

My approach: average generation data (MW) in wind speed bins of 0.5 m/s width. This gives me a smoother curve that I can interpolate using piecewise polynomial. Then I can look up what will be the power output for a given (future) forecast wind speed.

Any comments on this approach would also be appreciated.

$\endgroup$

1 Answer 1

6
$\begingroup$

You want to predict power output (in MW or MWh) based on forecasted wind speed (in m/s). Here’s a structured approach:

  1. Preprocessing
  • Convert DATE and HH into a timestamp for easier handling.
  • Convert SEASONAL_NORMAL_WIND_SPEED_KNOTS into m/s: 1 knot ≈ 0.51444m/s
  1. Visual Analysis
  • Plot GENERATION_MW vs. WIND_SPEED_MS in an interactive scatter plot, using Plotly. Expect a nonlinear "power curve" shape:
    • Low speeds → low generation
    • Moderate speeds → sharp increase
    • High speeds → plateau or drop
  1. Modeling
  • Use a regression model to learn the wind power curve. Try these baseline options: Polynomial Regression, Random Forest, Gradient Boosting (e.g., XGBoost), Support Vector Regression (SVR).
  • Do error analysis (RMSE, MAE by time of day or season).

You might also want to bin wind speeds and look at average generation per bin to get a smoothed empirical curve before fitting.

$\endgroup$

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.