Download - You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511
![Page 1: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/1.jpg)
You have data! What’s next?
Data Analysis, Your Research Questions, and Proposal Writing
Zoo 511
Spring 2014
![Page 2: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/2.jpg)
Part 1: Research Questions
![Page 3: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/3.jpg)
Part 1: Research Questions
Write down > 2 things you thought were interesting or engaging during the field trip
(can be a species, a habitat feature, a relationship, etc). You can phrase these as
questions, but you don’t have to yet.
![Page 4: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/4.jpg)
Part 1: Research Questions
What makes a good question?
![Page 5: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/5.jpg)
Your questions should be specific and answerable
Does sculpin CPUE differ among geomorphic units?
Is brown trout density related to flow velocity?
In what kind of stream are brown trout most
likely to be found?
What habitat do fish prefer?
NOT SO USEFUL USEFUL
![Page 6: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/6.jpg)
Current Velocity (m/s)B
row
n T
rout
/m2
0
1
2
3
4
5
6
Scu
lpin
per
min
ute
POOLRUNRIFFLE
Scu
lpin
CP
UE
…and statistically testable
Does sculpin CPUE differ among geomorphic units?
Is brown trout density related to flow velocity?
![Page 7: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/7.jpg)
Part 2: Statistics
How do we find the answer to our question?
![Page 8: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/8.jpg)
Why use statistics?Are there more green sunfish in pools or
runs?
Run541
Pool273
12 10•Statistics help us find patterns in the face of variation, and draw inferences beyond our sample sites•Statistics help us tell our story; they are not the story in themselves!
??
![Page 9: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/9.jpg)
Statistics Vocab
(take notes on your worksheet)
Categorical Variable: Discrete groups, such as Type of Reach (Riffle, Run, Pool)
Continuous Variable: Measurements along a continuum, such as Flow Velocity
What type of variable is “Mottled Sculpin /meter2”?
What type of variable is “Substrate Type”?
![Page 10: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/10.jpg)
Explanatory/Predictor Variable: Independent variable. On x-axis. The variable you use to predict another variable.
Response Variable: Dependent variable. On y-axis. The variable that is hypothesized to depend on/be predicted by the explanatory variable.
Statistics Vocab
![Page 11: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/11.jpg)
Mean: The most likely value of a random variable or set of observations if data are normally distributed (the average)
Variance: A measure of how far the observed values differ from the expected variables (Standard deviation is the square root of variance).
Normal distribution: a symmetrical probability distribution described by a mean and variance. An assumption of many standard statistical tests.
N~(μ1,σ1)N~(μ1,σ2) N~(μ2,σ2)
Statistics Vocab
![Page 12: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/12.jpg)
Hypothesis Testing: In statistics, we are always testing a Null Hypothesis (Ho) against an alternate hypothesis (Ha).
p-value: The probability of observing our data or more extreme data assuming the null hypothesis is correct
Statistical Significance: We reject the null hypothesis if the p-value is below a set value (α), usually 0.05.
Statistics Vocab
![Page 13: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/13.jpg)
What test do you need?
For our data, the response variable will probably be continuous.
T-test: A categorical explanatory variable with only 2 options.
ANOVA: A categorical explanatory variable with >2 options.
Regression: A continuous explanatory variable
![Page 14: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/14.jpg)
Tests the statistical significance of the difference between means from two independent samples
Student’s T-Test
Null hypothesis: No difference between means.
![Page 15: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/15.jpg)
Cross Plains Salmo Pond
Mottled Sculpin/m2
Compares the means of 2 samples of a categorical variable
p = 0.09
![Page 16: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/16.jpg)
Analysis of Variance (ANOVA)
Tests the statistical significance of the difference between means from two or more independent groups
Riffle Pool Run
Mot
tled
Scu
lpin
/m2
Null hypothesis: No difference between means
p = 0.03
![Page 17: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/17.jpg)
Precautions and Limitations
• Meet Assumptions
•Samples are independent
• Assumed equal variance (this assumption can be relaxed)
Variance not equal
sculpin density in pools sculpin density in runs
![Page 18: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/18.jpg)
Precautions and Limitations
• Meet Assumptions
•Samples are independent
• Assumed equal variance (this assumption can be relaxed)
• Observations from data with a normal distribution (test with histogram)
![Page 19: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/19.jpg)
Precautions and Limitations
• Meet Assumptions
•Samples are independent
• Assumed equal variance (this assumption can be relaxed)
• Observations from data with a normal distribution (test with histogram)
• No other sample biases
![Page 20: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/20.jpg)
Simple Linear Regression
• Analyzes relationship between two continuous variables: predictor and response
•Null hypothesis: there is no relationship (slope=0)
![Page 21: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/21.jpg)
Residuals
Least squared line (regression line: y=mx+b)
![Page 22: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/22.jpg)
Residuals
Residuals are the distances from observed points to the best-fit line
Residuals always sum to zero
Regression chooses the best-fit line to minimize the sum of square-residuals. It is called the Least Squares Line.
![Page 23: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/23.jpg)
Precautions and Limitations
• Meet Assumptions
• Relationship is linear (not exponential, quadratic, etc)
• X is measured without error
• Y values are measured independently
• Normal distribution of residuals
![Page 24: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/24.jpg)
Have we violated any assumptions?
![Page 25: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/25.jpg)
Residual Plots Can Help Test Assumptions
0
“Normal” Scatter
0Fan Shape: Unequal Variance
0
Curve (linearity)
![Page 26: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/26.jpg)
if assumptions are violated
• Try transforming data (log transformation, square root transformation)
• Most of these tests are robust to violations of assumptions of normality and equal variance (only be concerned if obvious problems exist)
• Diagnostics (residual plots, histograms) should NOT be reported in your paper. Stating that assumptions were tested is sufficient.
![Page 27: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/27.jpg)
Precautions and Limitations
• Meet Assumptions
• Relationship is linear (not exponential, quadratic, etc)
• X is measured without error
• Y values are measured independently
• Normal distribution of residuals
•Interpret the p-value and R-squared value
![Page 28: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/28.jpg)
Residuals
![Page 29: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/29.jpg)
P-value: probability of observing your data (or more extreme data) if no relationship existed
- Indicates the strength of the relationship, tells you if your slope (i.e. relationship) is non-zero (i.e. real)
R-Squared: indicates how much variance in the response variable is explained by the explanatory variable
-Does not indicate significance
![Page 30: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/30.jpg)
R-Squared and P-value
High R-Squared
Low p-value (significant relationship)
![Page 31: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/31.jpg)
R-Squared and P-value
Low R-Squared
Low p-value (significant relationship)
![Page 32: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/32.jpg)
R-Squared and P-value
High R-Squared
High p-value (NO significant relationship)
![Page 33: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/33.jpg)
R-Squared and P-value
Low R-Squared
High p-value (No significant relationship)
![Page 34: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/34.jpg)
We just talked about:
• Types of variables• 3 statistical tests: t-test, ANOVA, linear
regression• When to use these tests• How to interpret the test statistics• How to be sure you’re meeting assumptions
of the tests
![Page 35: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/35.jpg)
Part 3: Proposal
![Page 36: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/36.jpg)
Writing a Proposal
• What is the function of a proposal?– To get money
![Page 37: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/37.jpg)
Writing a Proposal
• What is the function of a proposal?• What information should go in a proposal?
– Research goals/objectives/hypotheses/questions– Why does this matter? (Rationale)– Procedure / Methods– Future directions / implications– Budget/cost analysis– Expected results
![Page 38: You have data! What ’ s next? Data Analysis, Your Research Questions, and Proposal Writing Zoo 511](https://reader035.vdocument.in/reader035/viewer/2022062423/56814796550346895db4cae6/html5/thumbnails/38.jpg)
Other data you can use
Previous years’ data on website: all of the same information was collected from the same place, around the same time of year. Replication!
USGS: http://waterdata.usgs.gov/nwis/uv?05435943
Background info: from the Upper Sugar River Watershed Association
Think about these data sources as you generate your questions.