it is showing error in
pd.read_csv(“ml-100k”,sep="|",header=None)
Error in code while importing dataset
C:\Users\ayush\AppData\Local\Temp\ipykernel_15400\3251950743.py:1: ParserWarning: Falling back to the ‘python’ engine because the ‘c’ engine does not support regex separators (separators > 1 char and different from ‘\s+’ are interpreted as regex); you can avoid this warning by specifying engine=‘python’. pd.read_csv(“ml-100k/u.item”,sep="|",header=None) --------------------------------------------------------------------------- UnicodeDecodeError Traceback (most recent call last) Cell In[20], line 1 ----> 1 pd.read_csv(“ml-100k/u.item”,sep="|",header=None) File c:\Users\ayush\AppData\Local\Programs\Python\Python311\Lib\site-packages\pandas\io\parsers\readers.py:912, in read_csv(filepath_or_buffer, sep, delimiter, header, names, index_col, usecols, dtype, engine, converters, true_values, false_values, skipinitialspace, skiprows, skipfooter, nrows, na_values, keep_default_na, na_filter, verbose, skip_blank_lines, parse_dates, infer_datetime_format, keep_date_col, date_parser, date_format, dayfirst, cache_dates, iterator, chunksize, compression, thousands, decimal, lineterminator, quotechar, quoting, doublequote, escapechar, comment, encoding, encoding_errors, dialect, on_bad_lines, delim_whitespace, low_memory, memory_map, float_precision, storage_options, dtype_backend) 899 kwds_defaults = _refine_defaults_read( 900 dialect, 901 delimiter, (…) 908 dtype_backend=dtype_backend, 909 ) 910 kwds.update(kwds_defaults) --> 912 return _read(filepath_or_buffer, kwds) File c:\Users\ayush\AppData\Local\Programs\Python\Python311\Lib\site-packages\pandas\io\parsers\readers.py:583, in _read(filepath_or_buffer, kwds) 580 return parser 582 with parser: --> 583 return parser.read(nrows) File c:\Users\ayush\AppData\Local\Programs\Python\Python311\Lib\site-packages\pandas\io\parsers\readers.py:1704, in TextFileReader.read(self, nrows) 1697 nrows = validate_integer(“nrows”, nrows) 1698 try: 1699 # error: “ParserBase” has no attribute “read” 1700 ( … 236 yield pat.split(line.strip()) File :322, in decode(self, input, final) UnicodeDecodeError: ‘utf-8’ codec can’t decode byte 0xe9 in position 2892: invalid continuation byte Output is truncated. View as a scrollable element or open in a text editor. Adjust cell output settings… this is the error
basically my separator is showing error
Hello @ayushdubeymanu_24c3c99b82b79971,
The sep= parameter is passed the  read the csv in which the data is separated by that particular separator, in the video the separator in csv file is tab ‘|’ but isn yours it could be comma ‘,’. So pls check your csv file and then pass sep= value.
I hope I’ve cleared your doubt. I ask you to please rate your experience here
Your feedback is very important. It helps us improve our platform and hence provide you
the learning experience you deserve.
On the off chance, you still have some questions or not find the answers satisfactory, you may reopen
the doubt.
