Movie rating prediction problem

Sir, I am facing a problem with training data after stopword removal our X_train become a list, and the vectorization function takes the only array as input and when I converting the list into an array it gives an error "Unable to allocate 1.26 GiB for an array with shape (40000,) and data type <U8476 ", and when I pass a list of cleaned x_train into vectorization function it gives an error “Unable to allocate 63.8 MiB for an array with shape (8367123,) and data type int64” but the size of the list is 40,000

suggest me any suitable solution.

hey @Jituk48216 ,
try changing the datatype to int8 or int32 .
As int64 takes more space, and your work can be done on int8 i guess.
If still there is some problem then kindly provide a link to your notebook.

Hope this helps.
Thank You :slightly_smiling_face:.

I hope I’ve cleared your doubt. I ask you to please rate your experience here
Your feedback is very important. It helps us improve our platform and hence provide you
the learning experience you deserve.

On the off chance, you still have some questions or not find the answers satisfactory, you may reopen
the doubt.