Output Format¶
The pipeline generates prediction files in the results/ directory.
Output Files¶
Two types of prediction files are generated:
File Pattern |
Description |
|---|---|
|
District-level predictions with threshold analysis |
|
State-level aggregated predictions |
Example filenames:
Predictions_Oct - Nov 2025_District_20251027.csvPredictions_Oct - Nov 2025_Karnataka_20251027.csv
District-Level Predictions¶
Column Specifications¶
Column Name |
Data Type |
Description |
|---|---|---|
|
String |
District identifier (e.g., “district_374”) |
|
Date |
Date of the prediction record (YYYY-MM-DD) |
|
Integer |
ISO week number of the year |
|
String |
Threshold calculation method: “historical” or “previousNweeks” |
|
Float |
Mean of historical cases for threshold calculation |
|
Float |
Standard deviation of historical cases |
|
Float |
Lower bound (typically 0.0) |
|
String |
Upper bound indicator |
|
Float |
Threshold tier 0 (baseline) |
|
Float |
Threshold tier 1 (elevated) |
|
Float |
Threshold tier 2 (high) |
|
Date |
Start date of the week being predicted |
|
Date |
Date when the prediction was computed |
|
String |
Region identifier matching the district |
|
Float |
Predicted number of dengue cases |
|
String |
Model used (typically “ensembleModel”) |
|
Float |
Zone classification based on threshold tiers |
State-Level Predictions¶
Column Specifications¶
Column Name |
Data Type |
Description |
|---|---|---|
|
Date |
Date when prediction was computed |
|
Date |
Start date of predicted week |
|
String |
State identifier (e.g., “state_29”) |
|
Float |
Predicted number of dengue cases for the state |
|
String |
Threshold calculation method used |
|
Integer |
Alert zone classification (0=low, higher=elevated risk) |
|
String |
Model used for prediction |
Data Characteristics¶
Temporal Resolution: Weekly predictions
Prediction Horizon: Multiple weeks ahead (typically 2-4 weeks)
Threshold Methods: Two methods for alert classification:
historical: Based on historical mean and standard deviationpreviousNweeks: Based on recent N weeks of data
Prediction Zones: Numerical classification indicating risk levels
Model Type: Ensemble model combining multiple forecasting approaches
Example Record¶
District-level:
district_374,2025-10-27,42,historical,4.5,0.707,0.0,inf,4.5,5.207,5.914,2025-10-27,2025-10-27,district_374,0.819,ensembleModel,1.0
State-level:
2025-10-27,2025-10-27,state_29,21.49,historical,0,ensembleModel