Quality control of ε estimates (QA2): Difference between revisions
From Atomix
Yuengdjern (talk | contribs) No edit summary |
Yuengdjern (talk | contribs) No edit summary |
||
(3 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
Quality control measures for each beam: | Quality control measures for each beam (flagged in benchmark datasets): | ||
# Data segments for which the regression coefficient a<sub>1</sub> (see [[Processing your ADCP data using structure function techniques | previous step]]) is negative yield an imaginary <math>\varepsilon</math> value, which should be rejected | # Data segments for which the regression coefficient a<sub>1</sub> (see [[Processing your ADCP data using structure function techniques | previous step]]) is negative yield an imaginary <math>\varepsilon</math> value, which should be rejected | ||
# Ensure sufficient <math> D_{ll} </math> samples were used in the regression. | # Ensure sufficient <math> D_{ll} </math> samples were used in the regression. | ||
# Use the coefficient <math>a_0</math> (the intercept of the regression) to estimate the noise of the velocity observations and compare to the expected value based on the instrument settings. If noise is too high, <math> \epsilon </math> are rejected. | # Use the coefficient <math>a_0</math> (the intercept of the regression) to estimate the noise of the velocity observations and compare to the expected value based on the instrument settings. If noise is too high, <math> \epsilon </math> are rejected. | ||
# Data segments for which the regression coefficient a<sub>0</sub> (see [[Processing your ADCP data using structure function techniques | previous step]]) is negative (implying a negative noise floor) are likely to be invalid and are typically rejected | # Data segments for which the regression coefficient a<sub>0</sub> (see [[Processing your ADCP data using structure function techniques | previous step]]) is negative (implying a negative noise floor) are likely to be invalid and are typically rejected | ||
# | # In the case of <math> \epsilon </math> estimated using the modified regression method that accounts for oscillatory motion, reject data for invalid values of <math> a_3 </math>. | ||
# A better indication of the quality of the fit is usually provided by looking at the ratio of the estimated <math>\varepsilon</math> value to that based on the 95%-ile confidence interval estimate of the a<sub>1</sub> regression coefficient e.g. reject values where the ratio exceeds a specified threshold | # A better indication of the quality of the fit is usually provided by looking at the ratio of the estimated <math>\varepsilon</math> value to that based on the 95%-ile confidence interval estimate of the a<sub>1</sub> regression coefficient e.g. reject values where the ratio exceeds a specified threshold | ||
# The goodness of fit (R<sup>2</sup>) for the regression provides a basic indication of the quality of the fit, data with low R<sup>2</sup> are typically rejected. | |||
Other measures (not flagged): | |||
# Examine the distribution of <math>\varepsilon</math> estimates - in most situations, this would be expected to be log-normal | # Examine the distribution of <math>\varepsilon</math> estimates - in most situations, this would be expected to be log-normal | ||
# Comparison of observed values with nominal values based on established boundary-forced scalings may also be informative and help to identify observation or processing issues | # Comparison of observed values with nominal values based on established boundary-forced scalings may also be informative and help to identify observation or processing issues | ||
Quality control measures for final <math> \epsilon </math> estimate: | |||
# Examine the consistency of <math>\varepsilon</math> between bins (if evaluated) and between beams as an indication of estimate reliability - the geometric mean between beams is frequently used as the representative value | |||
------ | ------ | ||
To see how the data flags are applied, go to [[Velocity Profiler data flags| Velocity Profiler Data Flags]] | |||
Latest revision as of 20:58, 3 June 2022
Quality control measures for each beam (flagged in benchmark datasets):
- Data segments for which the regression coefficient a1 (see previous step) is negative yield an imaginary [math]\displaystyle{ \varepsilon }[/math] value, which should be rejected
- Ensure sufficient [math]\displaystyle{ D_{ll} }[/math] samples were used in the regression.
- Use the coefficient [math]\displaystyle{ a_0 }[/math] (the intercept of the regression) to estimate the noise of the velocity observations and compare to the expected value based on the instrument settings. If noise is too high, [math]\displaystyle{ \epsilon }[/math] are rejected.
- Data segments for which the regression coefficient a0 (see previous step) is negative (implying a negative noise floor) are likely to be invalid and are typically rejected
- In the case of [math]\displaystyle{ \epsilon }[/math] estimated using the modified regression method that accounts for oscillatory motion, reject data for invalid values of [math]\displaystyle{ a_3 }[/math].
- A better indication of the quality of the fit is usually provided by looking at the ratio of the estimated [math]\displaystyle{ \varepsilon }[/math] value to that based on the 95%-ile confidence interval estimate of the a1 regression coefficient e.g. reject values where the ratio exceeds a specified threshold
- The goodness of fit (R2) for the regression provides a basic indication of the quality of the fit, data with low R2 are typically rejected.
Other measures (not flagged):
- Examine the distribution of [math]\displaystyle{ \varepsilon }[/math] estimates - in most situations, this would be expected to be log-normal
- Comparison of observed values with nominal values based on established boundary-forced scalings may also be informative and help to identify observation or processing issues
Quality control measures for final [math]\displaystyle{ \epsilon }[/math] estimate:
- Examine the consistency of [math]\displaystyle{ \varepsilon }[/math] between bins (if evaluated) and between beams as an indication of estimate reliability - the geometric mean between beams is frequently used as the representative value
To see how the data flags are applied, go to Velocity Profiler Data Flags
Return to ADCP Flow Chart front page