Preliminary
- Due Date: Wednesday, May 10 at 10:00
- You must complete this assignment individually.
Learning Outcomes
- Gain experience creating graphs with
matplotlib
. - Gain additional experience using NumPy arrays.
Background
- The input file for this assignment is montana-counties.txt.
- The first line contains the number of desired groupings (e.g.
4
). - The second line contains the number of counties (e.g.
56
). - The remaining lines contain information about each county using comma-separated values. The first value is the name of the county (e.g. Beaverhead County) and the second value is the population of that county in 2020 (e.g. 9401). These lines are arranged in alphabetical order by county name and contain information from this site.
Assignment
- Download lab05.py, rename it according to the instructions above, and make sure you understand it.
- Take the program above and modify it by adding the missing methods such that when the program is run, it produces this text output and this graphical output.
calculate_averages
Specification
- The function takes as input how many buckets (
number_buckets
) should be used to summarize the county census data (county_populations
). - The function should return a NumPy array that contains
number_buckets
slots. - You may assume that the number of counties is evenly divisible by the number of buckets. For our specific input file, there are 56 counties and 4 buckets so each slot in the array will summarize 14 counties. For convenience, denote this number (14) as
n
. - The first slot in the array should contain the average of the
n
least populous counties. - The second slot in the array should contain the average of the next
n
least populous counties.
graph_summary
Specification
- The function takes as input the NumPy array that is returned by the
calculate_averages
function. - The function uses matplotlib to produce this graph.
Requirements and Grading
- 3 points - The
print_summary
function produces the desired output when the complete program is run. - 1 point - The
graph_summary
function creates a graph. - 1 point - The words “Computer Science, Lab 5” appear in the top gray bar.
- 1 point - The graph is entitled “Montana County Population Analysis”.
- 1 point - Blue hexagons are used to represent each point.
- 1 point - The blue hexagons are connected with a dashed, cyan line.
- 1 point - The x-axis and y-axis are labeled with the correct labels.
- 1 point - The x-axis contains marks for 1, 2, 3 and 4 (and nothing else).
Submission
Place the solution in a file named counties_[yourname].py
and submit it to the Petra Portal no later than 12:45 on Wednesday, May 10.