Thursday, February 2, 2017

UnicodeDecodeError when reading CSV file in Pandas with Python 3+

Am getting the error "UnicodeDecodeError: 'utf-8' codec can't decode byte 0xba in position 10: invalid start byte "while running the following code
import pandas as pd
dataFile='~/BX-Book-Ratings.csv'
data=pd.read_csv(dataFile,sep=";",header=0,names=["user","isbn","rating"])
Solution:
df = pd.read_csv(dataFile,sep=";",header=0,encoding = "ISO-8859-1",names=["user","isbn","rating"])

No comments: