Recreate csv data files from log.gz?

goppenh2 · February 10, 2023, 6:05pm

Background:
I was having trouble syncing updates for a known-working Pavlovia experiment across multiple computers (laptop + desktop, synced via Google Drive; syncing with Pavlovia servers repeatedly generated gitlab 404 errors), so recently tried installing the latest version of Psychopy (2023.1.0, updating from 2022.2.3) in the hopes that it would fix the problem. In the process, the “Is trials?” checkbox for the main loop of my experiment somehow became unchecked; I suspect that might result from a compatibility problem with the new version of Psychopy, though I can’t completely rule out user error.

Description of the problem:
The situation now is that 8 participants have completed the experiment using the new version of this script. The csv file only saved data for the last of its 500 trials (because “Is trials?” somehow became disabled for the main loop), but the log.gz file that was saved along with it shows data from all of the trials. Is there a script available that could use the log.gz files to recreate the csv files for these participants?

Thanks!

goppenh2 · February 16, 2023, 8:10pm

I ended up writing some code myself to parse the log files and merge them with the corresponding conditions files and truncated csv files. It would have been useful to if Psychopav had some script that automatically parsed the log files, though.

wakecarter · February 16, 2023, 11:05pm

Hi. Would you be happy to upload and share your solution, or is it too specific to your experiment?

goppenh2 · February 17, 2023, 3:43pm

I think it’s too specific to my application to be of much use to anyone who couldn’t program their own script from scratch. But basically what I did (in R) was:

Create a list of csv files in my data directory and read each file, one at a time.
If the current csv file has too few trials, salvage the info that is there and then read in the *.log.gz file with the same name.
With the log file read in and parsed as a tsv, it becomes a dataframe with 3 columns and n rows.
Skip the first n rows until you get to the first row that looks like it represents trial level data; I read through a log file to identify some identifiers for my expt, but it would be different for other expts.
Initialize a dataframe with n rows, trialData, and initialize a currentrow counter at zero.
Figure out how to identify the start/end of each trial.
At the start of each trial, increment currentrow, and create a vector, identifying, parsing, and saving relevant events and info in that trial to the vector.
At the end of the trial, save the vector as currentrow in the trialData df.
After parsing all trials, trialData should have one row with entries for each trial in your experiment, and if you did it right it should have a fixed number of entries in each row.
Assuming that you used a csv to specify trial conditions, you can read that in and join or cbind it to trialData. If, like me, your csv specified a fixed sequence and you didn’t use psychopy to do any further randomization, then you can basically just tape it to your recovered trial data; if you did further randomisation in psychopy, then you’d need to figure out a way of unambiguously identifying which line from your conditions file goes with which recovered trial, and depending on your experiment and script that might not be possible.
Figure out how to verify that nothing went awry, and fix any problems.
Combine the trial data with whatever you were able to salvage from the csv file.

Topic		Replies	Views
Pavlovia randomly switching to gz files instead of csv Online experiments	8	634	March 15, 2022
Data output suddenly stopped being saved as csv! Online experiments	12	727	October 16, 2024
CSV files are empty! Online experiments data	10	996	September 26, 2019
Incomplete Log Files in Pavlovia Online experiments logfiles	4	1413	September 20, 2019
Pavlovia: (csv) data files are not being saved Online experiments pavlovia	22	3805	October 2, 2023

Recreate csv data files from log.gz?

Related topics