Acoustic Data Pipeline

Convert Praat or VoiceSauce-style TXT files, collapse tokens into normalized time intervals, merge token labels, then clean acoustic CSV files with within-speaker outlier removal and z-score columns.

1. TXT to CSV

Handles tab-delimited text files and removes trailing empty columns.

Download CSV

TXT file

Delimiter

Tab Auto

Choose a TXT file to convert.

2. Make Normalized Intervals

Split each token into a custom number of equal-duration bins and average selected measures.

Download Binned CSV

CSV file for interval binning

Delimiter

Intervals per token

Token start column

Token end column

Frame time column

Token identity columns Rows with the same selected token columns are treated as one token.

Measures to average Numeric acoustic columns are preselected.

Primary average The primary output column is measure_mean. Comparison columns can include both versions.

Add with/without 0 comparison columns and zero diagnostics

Upload a CSV or use the converted file above.

3. Merge Labels by Token and Time

Attach Full_Label by nearest seg_end within each token, then derive Voicing, Gender, and Context.

Download Labeled CSV

Data CSV to label

Data delimiter

Data seg_end column

Label CSV with Full_Label

Label delimiter

Label seg_end column

Data token column Use the shared token/file column, usually Filename or token_id.

Label token column Must refer to the same token identity as the data token column.

Full label column

Filename column Gender is the first character of this filename.

Maximum time distance Same unit as seg_end. Leave blank to always take the closest label within token.

Upload a data CSV and a label CSV, or use converted/binned data from earlier steps.

4. Clean CSV and Create Within-Speaker z-scores

Select speaker and measure columns, then export a cleaned CSV.

Download Cleaned CSV

CSV file

CSV delimiter

Speaker source Choose an existing speaker column, or derive speaker IDs from filenames.

Filename column

Speaker pattern Regular expression for derived speaker IDs. Default keeps text before the first underscore.

Measure columns Numeric columns are preselected. Use Command-click to adjust.

Outlier threshold Values farther than this many SDs from each speaker mean.

Removed-value output How removed outliers and invalid zeros are written in clean columns.

Treat 0 as missing for selected measures Remove within-speaker outliers in clean columns Add within-speaker z-score columns Compute z-scores after outlier removal

Upload a CSV file to configure cleaning.

5. Analysis and Visualization

Generate selectable R code for lmer/emmeans and draw quick mean +/- SE plots in the browser.

CSV file for analysis

Delimiter

Model response

Fixed effects Selected effects are joined with * for interactions.

Random intercept

emmeans pairwise factor

p-value adjustment

R data object name

Y measure

X interval

Color/group

Facet

X axis title

Y axis title

Legend title

X label map

Facet label map

Show mean +/- SE error bars

Upload a CSV, or use binned/cleaned data from earlier steps.