Not reading my dataset

Roopa1i_ma1hotra · July 9, 2020, 6:07pm

df=pd.read_csv(r"C:\Users\HP\ml-100k\u.data",sep=’\t’)

it is showing error “file not found” but i have already downloaded the file and it is present at (C:\Users\HP) this file path

Roopa1i_ma1hotra · July 9, 2020, 6:47pm

i have downloaded the zip file ml-100k.zip as directed in first video of movie recommendation project.

Aayushkh_333 · July 9, 2020, 6:48pm

Try removing the r before the path

Roopa1i_ma1hotra · July 9, 2020, 6:49pm

df=pd.read_csv(“C:\Users\HP\Desktop\ml-100k\u.data”,sep=’\t’)
^
SyntaxError: (unicode error) ‘unicodeescape’ codec can’t decode bytes in position 2-3: truncated \UXXXXXXXX escape

now it is showing this error

Aayushkh_333 · July 9, 2020, 6:50pm

Hey @Roopa1i_ma1hotra, do one thing. Go to u.data file in your laptop. Right click on the file and see it’s location or path…where it is originally stored. Copy that location in pd.read_csv( ) function. There must be an error in the path itself .

Roopa1i_ma1hotra · July 9, 2020, 6:55pm

there is no option to look for the location of u.data file

Aayushkh_333 · July 9, 2020, 7:03pm

In the properties option maybe ?

Roopa1i_ma1hotra · July 9, 2020, 7:04pm

thankyou sir. it’s done

Aayushkh_333 · July 9, 2020, 7:07pm

I hope you were able to understand the error !

Please mark the doubt as resolved in your doubts section
Happy Learning

ayush_bhardwaj5588 · October 13, 2020, 2:18am

df = pd.read_csv(‘C:\Users\Ayush Bhardwaj\Downloads\ml-100k\ml-100k’ , sep = “\t”)
getting the same . please help

lingmaaki · February 27, 2023, 5:22am

The error message you’re seeing indicates that Python encountered a string with a Unicode escape sequence that is truncated, meaning it doesn’t have the required number of hexadecimal digits. The position 2-3 in the error message refers to the location in the string where the error occurred.

You can solve this error “codec can’t decode bytes” by using a raw string by prefixing your string with the letter “r”. Raw strings don’t interpret backslashes as escape characters.

path = r"C:\Users\User\Documents\file.txt"

Double the backslashes in your string

path = "C:\\Users\\User\\Documents\\file.txt"

Use forward slashes instead of backslashes

path = "C:/Users/User/Documents/file.txt"

Unicode escape sequences can be used in both single-quoted (’…’) and double-quoted ("…") strings in Python. However, it’s important to note that in Python 3, strings are Unicode by default, so you can also include Unicode characters directly in a string without using escape sequences, as long as you use the appropriate character encoding.