U.S. Department of Education

 

 

 

Policy Priority: The Department of Education is focused on ensuring that parents, students, and policymakers are able to
use its publicly available data to take meaningful action to improve outcomes.
Supporting Decision-Making for Student Sub-populations and their Families
Problem: Certain mobile or disconnected student populations entering or reentering the community could greatly benefit
from data and resources to support their wellbeing and success. Such students and families often lack information that is
necessary to distinguish between their school options, access services, and identify affordable housing near high-quality
school and in safe neighborhoods that have access to transit and employment.
Identifying Equity Scores and Gaps
Problem: Within and across school districts and communities there are significant disparities in outcomes (e.g. achievement,
graduation rate) between different student groups, in whole or in part due to inequitable access to resources (e.g. per-pupil
expenditures, rigorous coursework, effective teachers). Decision-makers and parents to would benefit from information that
could help them understand these inequities and/or identify where gaps between groups of students may exist.
Your Task
You are a data scientist hired by a non-profit organization whose mission is to increase college graduation rates for
underpriveleged populations. Through advocacy and targeted outreach programs, your organization strives to identify and
alleviate barriers to educational achievement. A
Your team is committed to developing a more data-based approach to decision making. As a prelude to future analyses,
you are requested to analyze the data to identify clusters of similar colleges and universities
A Few Tips
1. Clustering Algorithm K-means is a powerful and recommended clustering algorithm, but the dimension is very high. You
may need to use dimension reduction/feature extraction methods to make preprocess the data. At the end of the day,
the choice of clustering technique(s) is yours.
2. Data Preparation: What variables to use (you obviously don’t have to use them all)? How will you deal with missing
values? Categorical variables? Normalization or scaling? These are all very subjective questions you need to figure out
as a data scientist. In addition to being completely data driven, you may also want to look into the educational theories
related to the problem statement and technical characteristics of the algorithm(s) you’re using.
3. Is it possible to explain what each cluster represents? Did you retain or prepare a set of features that enables a
meaningful interpretation of the clusters? Do the compositions of the clusters seem to make sense?
Submission
The annotated analytical process and the reproducible code.
In your submission, you may want to include an array of cluster labels corresponding to UNITID (the unique
college/university I.D. variable). Note: Due to the presence of missing data, some observations may be ommitted prior
to clustering.
A brief explanation (interpretation) of the clusters.
You can choose to work on this assignment individually or in a team (team size <= 3). If you want to work in a larger group,
email Lukas.
Your submission should be in .html format or .pdf format .ipynb or .Rmd file will not be accepted. This will
demonstrate how you communicate your code/analysis with others who may not have access to your data. If you work
choose to work in a team, only one of the team members needs to submit the assignment.
Your work will be evaluated on three simple criteria: (a) the implementation process of the clustering, (b) the clarity of your

 

Sample Solution

ily be tracked by monitoring the global sea level heights together with the seafloor morphology and glacial ice mensuration. This is due to the fact that sooner or later the glacial meltwater will end up in the oceans and due to the sped-up melting treated in 3.3 and 3.4, the rate will increase drastically in the future. It is also useful to create models of inundation, coastal erosion and potential storm damage in order to thwart these catastrophes since they are a predictable consequence of sea-level rise (National Research Council 2010, Chuvieco 2008). 3.6 Accumulating natural disasters The last point is the accumulation of extreme events. The USA experiences more and more record high temperatures every year, congruously the record low temperatures occur fewer than ever. However extreme events also include heat waves, droughts, floods, cyclones, and wildfires. Changes in the earth’s system diversity also follow as a response to weather and climate extremes. Species that prove unable to adapt to the new circumstances will ultimately disappear or have to surrender to more successfully adapted species. An increased number of strong blusters with mounting intensity is also an indicator for these extreme events, just like frequent insect infestations. Insects are the profiteers of global changes in wind patterns and/or sea level rise, as they can be transported great distances into regions usually not inhabited by them (Dukes 2009). Global changes can also cause epidemic diseases dangerous for humans, as wind and sea can transport disease vectors communicated by insects. Diseases are especially dangerous; As a side effect they result in the attenuation of a population’s resilience and ability to counteract or even respond to climate as well as other stressors. The consequences of such extremes range from the disruption of food production and water supply to increased rates of morbidity and mortality and consequences for the physical and psychological health of human beings (IPCC 2014). There are quite a few ways to survey climatic extremes, as these can be very diverse. In order to monitor severe storms, it is possible to track the annual storm number together with maximum wind speed, geographic storm tracks, precipitation and flash floods. With regard to insect infestations and whether they were dislocated, taking the number of insect infestations, the insect type, the land cover of the infestations, the crop impacts and the historic recurrence into account is helpful. Human diseases can be evaluated by the number and type of epidemics and the impacts they had concerning the fatalities or in general the number hospitalised, the historic recurrence and the geographically affected

This question has been answered.

Get Answer
WeCreativez WhatsApp Support
Our customer support team is here to answer your questions. Ask us anything!
👋 Hi, Welcome to Compliant Papers.