Log likelihood, problem in concept

sagartiwari1711 · May 18, 2020, 8:41am

prateek bhaiya has used negative of log likelihood in the code but has used gradient ascent , i am confused. we should use gradient descent to minimize -ve of log likelihodd

Aayushkh_333 · May 18, 2020, 3:52pm

Yes @sagartiwari1711, we need to use gradient descent and hence that is why the negative sign is used in code.

Hope this helps

Feel free to ask me again anytime.

sagartiwari1711 · May 18, 2020, 3:57pm

but mentor has used -ve sign in loss function, but used gradient ascent

w = w + learning_rategrad_w
b = b + learning_rategrad_b

Aayushkh_333 · May 18, 2020, 4:01pm

If you would look at the code again, the grad_w and grad_b have been multiplied by (-1) when the function is called which ultimately makes it

w = w - (learning_rate * grad_w)
b = b - (learning_rate * grad_b)

sagartiwari1711 · May 18, 2020, 4:03pm

its + only where is -1
i don’t uderstand

Aayushkh_333 · May 18, 2020, 4:18pm

Look at this function in the code :

def get_grads(y_true,x,w,b) :
…
…
…

grad_w += (-1)( y_true [i] - hx ) (x[i])
grad_b += (-1) * (y_true[i] - hx)

…
…

return [grad_w , grad_b]

sagartiwari1711 · May 18, 2020, 4:20pm

Here is the code from coding blocks repo, bhaiya put -1 and edited later in the video
def get_grads(y_true,x,w,b):

grad_w = np.zeros(w.shape)
grad_b = 0.0

m = x.shape[0]

for i in range(m):
    hx = hypothesis(x[i],w,b)
    
    grad_w += (y_true[i] - hx)*x[i]
    grad_b +=  (y_true[i]-hx)
    

grad_w /= m
grad_b /= m

return [grad_w,grad_b]

Aayushkh_333 · May 18, 2020, 4:21pm

So now is your doubt clear ? The code in the video is correct.

sagartiwari1711 · May 18, 2020, 4:29pm

Capture
bhaiya updated at 9:12 in the video itself

Aayushkh_333 · May 18, 2020, 4:54pm

Please look at the chat in your inbox.