Public health (large area) epidemiology

Module Exercise 2: Public health (large area) epidemiology

The exercise:

The Australian government Department of Health (federal) produces reports each year containing data on notifiable diseases which are of great use to those studying changes in disease distributions with space or time with the aim of planning country-wide control initiatives. To facilitate similar regional operations, states and territories produce annual Public Health Bulletins, zooming-in on the data at a higher level of resolution.

Part 1:  Access a table for NSW showing disease incidence for the years 2003 to 2012, and produce labelled, computer-generated time trend graphs for giardiasis and HIV infections using an application such as Excel®.

Part 2: Briefly discuss two possible reasons why each of these diseases might have increased or decreased over this period. Reference this discussion.

Aims of the exercise:

i. To acquire skills in the extraction, presentation, analysis and use of quantitative information from a large-area epidemiological report.

ii. To develop early perspectives on risk factors for specific diseases, and insight as to how and why these might change with time.

Hints:

i. Public Health Bulletins usually include data up to the year before they were published (eg: a 2012 bulletin usually contains data up to 2011).

ii. Departments are sometimes a few years behind with their bulletins, so a bulletin for the year 2013 might not be available until 2015.

iii. For comparison of disease incidence by places or by year, rates (not absolute numbers) are always used in epidemiology. Disease notification rates are usually given per 100,000 population.

Module Exercise 3: Bivariate linear regression analysis (correlation)

Background to the exercise:

As a preliminary step in a large-scale study of asthma in Armidale, New South Wales, you are asked to carry out a study to identify the impact of ambient atmospheric general particulate pollution (PM10) on the incidence of asthmatic wheeze in primary school children. Thermal inversions can occur periodically in the Armidale basin, trapping pollutants from point and diffuse sources in the lower atmosphere. 

To ensure an accurate medical diagnosis you select all primary school children attending a day clinic over a 30-day period in April.  In this month, other “confounding” risk factors (such as rainfall) are at relatively low levels, and therefore to some extent controlled.

From trained clinical staff you obtain a daily record of asthmatic wheeze incidence in children presenting for all medical conditions at the clinic during the study period. The daily air quality record is obtained from the Department of the Environment and a short latency period (minutes to hours) between exposure to ambient air particulates and production of symptoms is assumed. You produce the tabulated data shown on the next page.

The exercise:

Part 1: Plot a graph showing the relationship between asthma wheeze and ambient atmospheric particulate matter (PM10) using a recognised computer application such as Excel®. Add a computer-generated line of best fit, assuming a linear relationship. Present the graph for assessment with a comment on the type of correlation (direct or inverse), its electronically-computed strength in terms of Pearson’s Product Moment Correlation Coefficient r  (some versions of the graph on Excel also give this), and a qualitative interpretation of this result (eg: “low correlation”, “moderate correlation”, etc.)

Part 2: Using the formula and table given in the module notes, hand-calculate Pearson’s Product Moment Correlation Coefficient, r. Submit the tabulation used to generate values for the algebraic formula, along with your calculated value for r. Comment on the possible reason for any differences noted between the result obtained in parts 1 and 2.

Aim of the Exercise:

i. To gain an understanding of the use of bivariate linear regression analysis as a fundamental but powerful epidemiological analytical tool.

ii. To gain a conceptual idea of an industrially generated, environmental risk factor for an important health condition.

DayTotal number of children with asthmatic wheeze Total number of children attending the clinic that dayAmbient atmospheric particulates (PM10 in µg/m3)Blank column for calculated values
11142040 
2823045 
31119090 
42455060 
53164350 
63971060 
739560360 
826302320 
919200110 
103158770 
112258980 
122163264 
131458550 
142760250 
1522320130 
1616245220 
1724558100 
182657060 
194260340 
203655540 
2146599100 
2217197160 
2316197190 
242652080 
252247650 
261960040 
271455730 
281748140 
291022550 
301019040 

Hints:

i. If the question looks confusing and perplexing you probably need to go back to the module notes where the approach is clearly explained, and work through an example.

ii. When finished check your calculations thoroughly as marks are awarded for both method and the correct answer. With care it is relatively easy to score 100%.

iii. The first step when working with raw data is always to classify (ie: to construct a table). When in doubt, tabulate, when masses of numbers will always become clearer.

iv. Ensure accuracy by using one more decimal place in your calculations than you intend to give in your answer.

v. Use the formula in the module notes rather than the one given in text books, which is primarily for statisticians.

vi. When comparing health states (diseases and fitness) always use rates.

vii. Excel® does not do as much as SPSS and Minitab, but is probably the most user-friendly program to use, and links well with Word®. For example, values in the Word table can be cut and pasted into Excel®. Adding the line of best fit in Excel® involves highlighting the graph first by clicking on it, when the menu tab for this function will appear.

"Looking for a Similar Assignment? Get Expert Help at an Amazing Discount!"