Preliminary

  • Due Date: Wednesday, May 10 at 10:00
  • You must complete this assignment individually.

Learning Outcomes

  • Gain experience creating graphs with matplotlib.
  • Gain additional experience using NumPy arrays.

Background

  • The input file for this assignment is montana-counties.txt.
  • The first line contains the number of desired groupings (e.g. 4).
  • The second line contains the number of counties (e.g. 56).
  • The remaining lines contain information about each county using comma-separated values. The first value is the name of the county (e.g. Beaverhead County) and the second value is the population of that county in 2020 (e.g. 9401). These lines are arranged in alphabetical order by county name and contain information from this site.

Assignment

  • Download lab05.py, rename it according to the instructions above, and make sure you understand it.
  • Take the program above and modify it by adding the missing methods such that when the program is run, it produces this text output and this graphical output.

calculate_averages Specification

  • The function takes as input how many buckets (number_buckets) should be used to summarize the county census data (county_populations).
  • The function should return a NumPy array that contains number_buckets slots.
  • You may assume that the number of counties is evenly divisible by the number of buckets. For our specific input file, there are 56 counties and 4 buckets so each slot in the array will summarize 14 counties. For convenience, denote this number (14) as n.
  • The first slot in the array should contain the average of the n least populous counties.
  • The second slot in the array should contain the average of the next n least populous counties.

graph_summary Specification

  • The function takes as input the NumPy array that is returned by the calculate_averages function.
  • The function uses matplotlib to produce this graph.

Requirements and Grading

  • 3 points - The print_summary function produces the desired output when the complete program is run.
  • 1 point - The graph_summary function creates a graph.
  • 1 point - The words “Computer Science, Lab 5” appear in the top gray bar.
  • 1 point - The graph is entitled “Montana County Population Analysis”.
  • 1 point - Blue hexagons are used to represent each point.
  • 1 point - The blue hexagons are connected with a dashed, cyan line.
  • 1 point - The x-axis and y-axis are labeled with the correct labels.
  • 1 point - The x-axis contains marks for 1, 2, 3 and 4 (and nothing else).

Submission

Place the solution in a file named counties_[yourname].py and submit it to the Petra Portal no later than 12:45 on Wednesday, May 10.