• Pre-K provider data, including geographic data, licensure, and other quality indicators
    • Schools associated with pre-K provider data by computing a weighted (by physical distance from the school) sum of features for all providers within a 40 mile radius
  • 203 total features, including geographical location of school and other enrollment data
  • District-level covariates included from the Stanford Educational Data Archive
Summary of features
Variable
Number Cases
Number Missing
Mean
SD
Levels
leaid
125338
0
4107012.91
3895.13
​
distid
125338
0
2121.43
192.42
​
year
125338
0
​
​
2014-15, 2015-16, 2016-17
lat
125338
0
44.77
1.02
​
lon
125338
0
-122.52
1.18
​
gender
125338
0
​
​
F, M
ethcode
125338
0
​
​
H, W, B, M, I, A, P
sch_type_text
125338
0
​
​
Regular School, Alternative Education School, Special Education School, Alternative School
recon_status
125338
0
​
​
No
charter_text
125338
0
​
​
No, Yes
1–10 of 203 rows
...

Pre-K Features
Variable
Number Cases
Number Missing
Mean
SD
Levels
leaid
125338
0
4107012.91
3895.13
​
distid
125338
0
2121.43
192.42
​
year
125338
0
​
​
2014-15, 2015-16, 2016-17
lat
125338
0
44.77
1.02
​
lon
125338
0
-122.52
1.18
​
gender
125338
0
​
​
F, M
ethcode
125338
0
​
​
H, W, B, M, I, A, P
sch_type_text
125338
0
​
​
Regular School, Alternative Education School, Special Education School, Alternative School
recon_status
125338
0
​
​
No
charter_text
125338
0
​
​
No, Yes
1–10 of 203 rows
...

3.00

Test RMSE

2.39

Test MAE

0.11

Test R2

8.80

Test RMSE

6.76

Test MAE

0.17

Test R2

6.02

Test RMSE

4.59

Test MAE

0.02

Test R2