Unable to show unicode from excel files

juanmarinlahoz · December 13, 2016, 11:40am

I have written an experiment in which the subject has to choose between to unicode characters. The content of the text comes from an excel condition file. It works perfectly on Windows 7 and 8.1 but when I try to run it in a laptop with Windows 10 all the unicode characters are shown as squares. First time I tried using the latest psychopy version but then I downgraded to 1.83.04 because that is the one running on the 8.1 computer.
The script starts with

#!/usr/bin/env python
# -*- coding: utf-8 -*-

The encoding of the excel files have not changed.

Any ideas why the code does not work in the new laptop?

Michael · December 13, 2016, 8:39pm

What happens with a .csv format conditions file?
Try uploading a minimal example here so the problem can be reproduced.

juanmarinlahoz · December 15, 2016, 9:30am

If I save a unicode tex file in Excel instead of xlsx (I use unicode text because CSV option turns unicode into “?”) I got some errors:

################## Running: E:\jap_trial\jap_trial_lastrun.py ##################
pyo version 0.6.6 (uses single precision)
Traceback (most recent call last):
  File "E:\jap_trial\jap_trial_lastrun.py", line 76, in <module>
    trialList=data.importConditions(u'jap_loop.csv'),
  File "C:\Program Files (x86)\PsychoPy2\lib\site-packages\psychopy-1.83.04-py2.7.egg\psychopy\data.py", line 1980, in importConditions
    trialsArr = read_csv(fileUniv, encoding='utf-8')  # use pandas reader, which can handle commas in fields, etc
  File "C:\Program Files (x86)\PsychoPy2\lib\site-packages\pandas\io\parsers.py", line 474, in parser_f
    return _read(filepath_or_buffer, kwds)
  File "C:\Program Files (x86)\PsychoPy2\lib\site-packages\pandas\io\parsers.py", line 250, in _read
    parser = TextFileReader(filepath_or_buffer, **kwds)
  File "C:\Program Files (x86)\PsychoPy2\lib\site-packages\pandas\io\parsers.py", line 566, in __init__
    self._make_engine(self.engine)
  File "C:\Program Files (x86)\PsychoPy2\lib\site-packages\pandas\io\parsers.py", line 705, in _make_engine
    self._engine = CParserWrapper(self.f, **self.options)
  File "C:\Program Files (x86)\PsychoPy2\lib\site-packages\pandas\io\parsers.py", line 1072, in __init__
    self._reader = _parser.TextReader(src, **kwds)
  File "pandas\parser.pyx", line 509, in pandas.parser.TextReader.__cinit__ (pandas\parser.c:4748)
  File "pandas\parser.pyx", line 663, in pandas.parser.TextReader._get_header (pandas\parser.c:6603)
UnicodeDecodeError: 'utf8' codec can't decode byte 0xff in position 0: invalid start byte

Previusly when using the xlsx I got no errors but all the letters were squares.

I have made in the builder a simplified version with the same problem (runs on my Win 8.1 laptop but not in de Win 10):
jap_trial_lastrun.py (8.4 KB)
jap_trial.psyexp (5.8 KB)

I attach also the conditions
jap_loop.csv (30 Bytes)
jap_loop.xlsx (8.7 KB)

Thank you

richard · December 15, 2016, 9:54am

The Excel file itself is fine, and pandas 0.18.1 loads it flawlessly for me in the Jupyter Notebook. In my terminal window I only get ?'s, though, so this appears to be a problem with the fonts used.

richard · December 15, 2016, 9:56am

This shouldn’t be related to your problem.

juanmarinlahoz · December 15, 2016, 11:37am

I wrote it just in case some one suspected that was the problem

juanmarinlahoz · December 15, 2016, 11:50am

That actually helped!

I do not understand exactly why but it works now.

I installed Korean and Japanese fonts. I choosed Meiryo for the stimuli and worked. This is strange for me because under Excel all the strings are corretly shown in Arial but not in psychopy using the same font.

Now the instructions are shown in Arial and the stimuli in Meiryo (Meiryo also work for latin instructions but they do not fit in the original place)

Thank you

richard · December 15, 2016, 11:56am

Awesome. Don’t know why PsychoPy cannot use Arial, though…

jon · December 15, 2016, 5:05pm

My guess is that Arial font doesn’t really include korean or japanese characters. I imagine Excel detects that and switches (silently) to an alternative when it sees your characters. PsychoPy isn’t that smart!

jon · December 15, 2016, 5:08pm

Actually from this link I’ll change my theory:
http://www.wazu.jp/gallery/views/View_ArialUnicodeMS.html

Maybe there’s a version of Arial that supports kanji but that’s a different font file from the one psychopy is using.

juanmarinlahoz · December 16, 2016, 2:55pm

I do not really know. The thing is under linux (Mint 13 and Mint 14) and Windows 7 and 8.1 Arial works fine both with kanji and Korean. But using the same version of psychopy under Windows 10 it does not. It would be good to know why to avoid possible upcoming issues. On the other hand, I am quite happy because today I used the task on a real participant and everything was as expected.

richard · December 16, 2016, 5:28pm

I have created an issue:
https://github.com/psychopy/psychopy/issues/1324

richard · December 16, 2016, 5:34pm

I wonder if Windows 10 ships with a different Arial font

saluwe · November 8, 2017, 9:29am

I am having unfortunately the same problem, but up to now, none of the suggested solutions worked for me.
My experiment uses .xlsx condition files. Changing them to .csv files does not work. The error I get is "UnicodeDecodeError: ‘utf8’ codec … (see picture attached).

The experiment runs fine on a laptop with windows 7 in version 1.83.04. But I am using a new configured computer (also Windows 7) and the PsychoPy Version 1.85.3.

Any ideas and suggestions why I get this error?

daniel.riggs1 · November 8, 2017, 2:12pm

This doesn’t seem to be the same issue at all. I would start a new post and include more information, there’s not much here to go on.

Michael · November 8, 2017, 8:26pm

@daniel.riggs1 is right, your problem is actually likely related to this thread:

Topic		Replies	Views
Able to encode utf-16 or something in builder view? Builder	10	2483	February 27, 2020
Messy data on Excel Builder	7	4113	June 25, 2018
Unicode DeCode Error 'utf8' codec and audio driver Builder unicode , sound	2	2195	November 14, 2017
# -- coding: utf-8 -- ruins my code Coding	0	1350	October 4, 2019
Conditions error_No parameters set Builder	6	1371	November 16, 2017

Unable to show unicode from excel files

Related topics