I am getting score of -387 % even after standardization of data.
Here is my code. May be there is some error in my code.
Air pollution prediction
os.chdir(“C://Users//Santosh//Downloads/Train_air/”)
data_set=pd.read_csv(“Train.csv”)
data_set
X=np.array(data_set.loc[:,[True,True,True,True,True,False]])
X.shape
y=np.array(data_set.loc[:,[False,False,False,False,False,True]])
model=LinearRegression()
errors=model.fit(X,y)
print(errors)
yh=model.predict(X)
yh[:10]
y[:10]
model.score(X,y)
X_test=pd.read_csv(“C:\Users\Santosh\Downloads\test\test.csv”)
X_test
y_test=model.predict(X_test)
import os
os.chdir(“C:\Users\Santosh\Downloads\Test\”)
submission=pd.read_csv(“Sample_Submission.csv”)
submission[“target”]=y_test
submission