Segmenting datasets: Difference between revisions

Revision as of 14:40, 11 July 2022

Once the raw observations have been quality-controlled, then you must split the time series into shorter segments by considering:

Time and length scales of turbulence
Stationarity of the segment and Taylor's frozen turbulence hypothesis
Required statistical significance of the resulting spectra (only important if you need to remove motion-induced contamination from the spectra)

Considerations

Measurements are typically collected in the following two ways:

continuously, or in such long bursts that they can be considered continuous
short bursts that are typically at most 2-3x the expected largest turbulence time scales (e.g., 10 min in ocean environments)

This segmenting step dictates the minimum burst duration when setting up your equipment. The act of chopping a time series into smaller subsets, i.e., segments, is effectively a form of low-pass (box-car) filtering. The length of the segment in time is usually a more important consideration than detrending the time series when estimating $ε$ from the inertial subrange of the final spectra.

The shorter the segment, the higher the temporal resolution of the final $ε$ time series, and the more likely the segment will be stationary. The segment must remain sufficiently long such that the lowest wavenumber (frequencies) of the inertial subrange are retained by the computed spectra. This is particularly important when measurement noise drowns the highest wavenumber (frequencies) of the inertial subrange. Thus, using too short segments may inadvertently render the spectra unusable for deriving $ε$ from the inertial subrange by virtue of no longer resolving this subrange as shown in (Fig. 1).

Recommendations

A good rule of thumb for tidally-influenced environments is 5 to 15 min segments, but this may be shorter in certain energetic and fast-moving flows (Fig. 2) and longer in less energetic environments (Fig.3). The final segment length is partly a function of the fft-length and the desired statistical significance (degrees of freedom) of the final computed spectra.

Minimum fft-length

Fig. 1 provides a guide to the fft-length required for resolving different subrange as a function of the speed past the sensor, and $ε$ . For instance, an fft-length of 4 s would resolve one decade of the inertial subrange at speeds past the sensor of 0.5 m/s and $ε \sim 1 0^{- 7}$ W/kg. Longer segments would be required for slower flows or lower $ε$ . At $ε \approx 1 0^{- 9}$ W/kg, one decade of the inertial subrange would be resolved with an fft-length longer than 10s provided the speed was faster than 0.5 m/s.

Because the inertial subrange may be contaminated at the highest wavenumbers by instrument noise, we suggest using longer segments than the minimum shown in Fig. 1b. This strategy also enables having a larger number of spectral observations to fit over the inertial subrange given the spectral resolution also depends on the fft-length.

Choosing segment-length

The final segment length may be larger than the fft-length depending if you use block- or band-averaging for computing the spectra Refer to textbook here?. The maximum segment length should be shorter than the largest turbulent time scales.

Are the peaks in the MAVS data vortex shedding from the rings. Check the motion sensors onboard?

Fig. 3: Same as Fig 1 but for a different dataset with low speeds and low $ε$ , requiring the use of relatively long segments (1024s) to estimate the spectra from fft-length of 512 s (4096 samples @ 8 Hz).

Overlapping segments

Using overlapping segments, i.e., obtaining your first $ε$ estimate from time 0 to 5 min, and the second estimate from 2.5 to 5 min (50% overlap) essentially smoothes the final timeseries $ε$ . One advantage of using overlapping segments is that you can recover estimates before and after sudden changes in flow conditions that render one segment unusable for getting $ε$ . The use of overlapping segments is purely a matter of preference, and does not impact the quality of the final timeseries of epsilon.

Return to Preparing_quality-controlled_velocities

@@ Line 12: / Line 12: @@
 * continuously, or in such long [[Burst sampling|bursts]] that they can be considered continuous
 * short [[Burst sampling|bursts]] that are typically  at most 2-3x the expected largest [[Time and length scales of turbulence|turbulence time scales]] (e.g., 10 min in ocean environments)
-This segmenting step dictates the minimum [[Burst sampling|burst]] duration when setting up your equipment. The act of chopping a time series into smaller subsets, i.e., segments, is effectively a form of low-pass (box-car) filtering. The length of the [[Segmenting datasets|segment]] in time is usually a more important consideration than [[Detrending time series|detrending the time series]] when estimating <math>\varepsilon</math> from the [[Velocity inertial subrange model|inertial subrange]] of the final spectra.
+This segmenting step dictates the minimum [[Burst sampling|burst]] duration when setting up your equipment. The act of chopping a time series into smaller subsets, i.e., segments, is effectively a form of low-pass (box-car) filtering. The length of the [[Segmenting datasets|segment]] in time is usually a more important consideration than [[Detrending time series#detrend_ex|detrending the time series]] when estimating <math>\varepsilon</math> from the [[Velocity inertial subrange model|inertial subrange]] of the final spectra.
 The shorter the segment, the higher the temporal resolution of the final <math>\varepsilon</math> time series, and the more likely the segment will be [[Stationarity|stationary]]. The segment must remain sufficiently long such that the lowest wavenumber (frequencies) of the [[Velocity inertial subrange model|inertial subrange]] are retained by the [[Compute the spectra|computed spectra]]. This is particularly important when measurement noise drowns the highest wavenumber (frequencies) of the [[Velocity inertial subrange model|inertial subrange]]. Thus, using too short segments may inadvertently render the spectra unusable for deriving  <math>\varepsilon</math> from the [[Velocity inertial subrange model|inertial subrange]] by virtue of no longer resolving this subrange as shown in  ([[#fftlength|Fig. 1]]).

Anonymous

Search

Segmenting datasets: Difference between revisions

Namespaces

More

Page actions

Revision as of 14:40, 11 July 2022

Contents

Considerations

Recommendations

Minimum fft-length

Choosing segment-length

Overlapping segments

Navigation

Navigation

ATOMIX

Other

Wiki tools

Wiki tools

Anonymous

Search

Segmenting datasets: Difference between revisions

Revision as of 14:40, 11 July 2022

Considerations

Recommendations

Minimum fft-length

Choosing segment-length

Overlapping segments

Navigation

Wiki tools

Page tools

Categories