(research methods)(validity)(reliability)(Validity vs. Reliability)(experimental design) (Internal threats) (external threats)

Research Methods: (top)
| The higher number indicates the greater the emphasis . . . | Description |
Explanation |
Validation |
| Qualitative methods | 5 |
5 |
1 |
| theory building | 5 |
5 |
1 |
| classification schemes and theory building (same ass model building except used to describe phenomena and events such as classifying new plants) | 4 |
3 |
2 |
| Survey methods | 4 |
3 |
2 |
| Historical research | 4 |
3 |
2 |
| evaluation study | 4 |
3 |
2 |
| single-subject studies | 4 |
3 |
2 |
| longitudinal studies | 4 |
3 |
3 |
| meta-analysis | |||
| model building (such as supply and demand models used in prediction studies) | 1 |
3 |
5 |
| experimental methods | 1 |
3 |
5 |
Links to previous studies and explanations, rationale, theory, point of view:
Observation and measurements: (top)
Kind of evidence of Validity |
Purpose |
Question Posed |
How Obtained |
| Construct validity - evidence to show that the measure behaves as we would expect it to represent the concept | To show that the test is a valid measure of the construct | Does the test behave as would be expected of a valid test of the construct? | Determine how the test would behave if valid and demonstrate that it so behaves (correlate with other measures on the construct; show an internal structure as predicted from the construct) |
Criterion-related
validity:
|
To demonstrate how the test would work when used to predict success or failure. | Does the test predict success and failure of entrants? | Correlate test scores obtained at or prior to entrance to a program with a criterion measure of success obtained at end of program. |
|
To estimate predictive validity | Does the test predict success and failure of survivors? | Correlate test scores obtained at the same time as the criterion measures of success is obtained |
| Content Validity - competency or domain validity, represents the content of an area | To show how well the test covers the domain of a subject it is designed to test | Does the test sample the content and behaviors of its domain with adequate coverage and proper emphasis? | Compare test items with a table of specifications that describe the intersections of the content covered and the behaviors taught and indicates the relative emphasis of each intersection |
| Face validity - measure looks as though it would be valid | To show that the test meets our expectations of a valid test, especially important for a test used for public policy purposes with laypersons untutored in measurement | Does the test look like a valid measure? | Examine the test, usually by an expert in the field being measured. |
(top)
Question posed |
Area of consistency of responses |
Purpose |
How estimated |
|
| Internal Consistency consistency of a test in measuring whatever it measures | Are all items measuring the same thing a single construct or dimensions? Is the score Interpretable? | Items of varying difficulty and content | To determine the homogeneity of the test and the interpretability of scores | Use random split halves or the Kuder-Richardson or Cronbachs alpha for tests with weighted responses |
| Equivalence - consistency of scores over two different samples of test items, such as two forms of the same test | Are different forms of the test equivalent? | Different forms of a test | To determine whether the forms can be used interchangeably | Correlate the scores of form A with those of form B |
| Stability - how consistent are the measures over time | Is the behavior stable over time? | Different testing times (intervals should approximate that of intended use) | To determine the stability of the construct over time | Correlate the scores at one testing with those of the second testing |
| Stability and Equivalence interaction of different tests taken as different times | Will one form of the test produce the same results as another form at a later time? | Different forms of a test given at different times | To determine the stability of the construct over time and across test forms | Correlate the score at one testing with those on the other form at a later testing |
Can you have a test that is reliable and not validity OR valid and not reliability?(top)
| Reliable | Yes | Yes | No |
| Valid | No | Yes | No |
Experimental Design and Internal and External Threats to Validity:(top)
Internal threats to validity (Krathwohl, p 474-475)
The Threats to Internal Validity chart below indicates the following:(top)
Research design: X=treatment, O=observation/data collection, C=control group, R=randomization
| Research design | History | Maturation | Testing | Instrum | Regression | Selection | Mortaility | select X mature | test X treatment | selection X treatment |
| XO | - |
- |
- |
- |
||||||
| OXO | - |
- |
- |
- |
? |
+ |
+ |
- |
- |
- |
| OXO OCO |
+ |
+ |
+ |
+ |
? |
+ |
+ |
- |
- |
? |
| R OXO ...OCO |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
- |
? |
| R XO ...CO |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
? |
| ...OXO ...OCO R XO ...CO |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
+ |
? |
| 000X00 | - |
+ |
+ |
? |
+ |
+ |
+ |
- |
+ |
+ |
Interpreting the chart:
The symbols mean:
" - " this design does not control for this threat;
"+" this design controls for this threat;
"?" design may or may not control for the threat
For example . . .
The pre-test, post-test, treatment-control group is threatened by selection-maturity interaction and testing-treatment interaction (most likely because of the lack of randomization); also, depending on the treatment it may or may not control for regression and selection-treatment interaction. By collecting observation before the treatment from the treatment and control groups it accounts for history and maturation threats; by having a treatment and control group it accounts for threats based on testing, regression (potentially), selection, and mortality.
On the other hand, the randomized pre-test, post-test, treatment-control group is only threatened by the testing-treatment interaction and perhaps by the selection-treatment interaction.
External threats to validity(top)
Obtrusiveness:
Addressing external, obtrusive threats:
Researcher Expectancy Threats:
Addressing external, researcher expectancy threats:
(top)