Help with Likelihood Function Notation in Logistic Regression
up vote
2
down vote
favorite
Can someone please help me understand below notation I encountered while studying logistic regression? I am pretty sure I lack the mathematical maturity but would like to give it a try.
$l(beta_0,beta_1) = prod_{i:y_{i=1}} p(x_i) prod_{i':y_{i'=0}} (1-p(x_i'))$
where $p(X)$ = $frac{e^{beta_0+beta_1X}}{1+e^{beta_0+beta_1X}}$
- What does $prod$ notation stand for?
- What does the index $i$ comprise of?
- What does the lower bound $i:y_{i=1}$ mean?
- What does the first equation mean on the whole?
Please let me know if I missed providing any context information.
statistics
add a comment |
up vote
2
down vote
favorite
Can someone please help me understand below notation I encountered while studying logistic regression? I am pretty sure I lack the mathematical maturity but would like to give it a try.
$l(beta_0,beta_1) = prod_{i:y_{i=1}} p(x_i) prod_{i':y_{i'=0}} (1-p(x_i'))$
where $p(X)$ = $frac{e^{beta_0+beta_1X}}{1+e^{beta_0+beta_1X}}$
- What does $prod$ notation stand for?
- What does the index $i$ comprise of?
- What does the lower bound $i:y_{i=1}$ mean?
- What does the first equation mean on the whole?
Please let me know if I missed providing any context information.
statistics
add a comment |
up vote
2
down vote
favorite
up vote
2
down vote
favorite
Can someone please help me understand below notation I encountered while studying logistic regression? I am pretty sure I lack the mathematical maturity but would like to give it a try.
$l(beta_0,beta_1) = prod_{i:y_{i=1}} p(x_i) prod_{i':y_{i'=0}} (1-p(x_i'))$
where $p(X)$ = $frac{e^{beta_0+beta_1X}}{1+e^{beta_0+beta_1X}}$
- What does $prod$ notation stand for?
- What does the index $i$ comprise of?
- What does the lower bound $i:y_{i=1}$ mean?
- What does the first equation mean on the whole?
Please let me know if I missed providing any context information.
statistics
Can someone please help me understand below notation I encountered while studying logistic regression? I am pretty sure I lack the mathematical maturity but would like to give it a try.
$l(beta_0,beta_1) = prod_{i:y_{i=1}} p(x_i) prod_{i':y_{i'=0}} (1-p(x_i'))$
where $p(X)$ = $frac{e^{beta_0+beta_1X}}{1+e^{beta_0+beta_1X}}$
- What does $prod$ notation stand for?
- What does the index $i$ comprise of?
- What does the lower bound $i:y_{i=1}$ mean?
- What does the first equation mean on the whole?
Please let me know if I missed providing any context information.
statistics
statistics
asked Nov 13 at 18:34
DoLare
111
111
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
up vote
1
down vote
It is not the notation I would choose, but essentially you seem have some observations $x_a,x_b,ldots$ which produce successful responses, which seem to be shown as $y_a=1, y_b=1, ldots$, and some other observations $x^prime_j,x^prime_k,ldots$ which produce unsuccessful responses, which seem to be shown as $y^prime_j=0, y^prime_k=0, ldots$
The model is based of the probability that $x$ produces a successful response $y=1$ being expressed as $p(x) = frac{e^{beta_0+beta_1x}}{1+e^{beta_0+beta_1x}}$ and so the probability that $x$ produces an unsuccessful response $y=0$ being $1-p(x)$
In answer to your questions:
$prod$ means take the product over the index, in a similar way to $sum$ meaning the sum- In the first product $i$ seem to represent the index of the successful cases, while the second product $i^prime$ seem to represent the index of the unsuccessful cases
$i:y_i=1$ seems to mean taking the index over the successful cases, $i^prime:y_{i^prime}=0$ seems to mean taking the index over the unsuccessful cases,- The whole expression is saying that the likelihood of the observations is (proportional to) the product of the probabilities of the successful cases being successful and the unsuccessful cases being unsuccessful assuming the probabilities follow the model family. Presumably the next step is to find the $beta_0$, $beta_1$ which maximise this likelihood
add a comment |
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
1
down vote
It is not the notation I would choose, but essentially you seem have some observations $x_a,x_b,ldots$ which produce successful responses, which seem to be shown as $y_a=1, y_b=1, ldots$, and some other observations $x^prime_j,x^prime_k,ldots$ which produce unsuccessful responses, which seem to be shown as $y^prime_j=0, y^prime_k=0, ldots$
The model is based of the probability that $x$ produces a successful response $y=1$ being expressed as $p(x) = frac{e^{beta_0+beta_1x}}{1+e^{beta_0+beta_1x}}$ and so the probability that $x$ produces an unsuccessful response $y=0$ being $1-p(x)$
In answer to your questions:
$prod$ means take the product over the index, in a similar way to $sum$ meaning the sum- In the first product $i$ seem to represent the index of the successful cases, while the second product $i^prime$ seem to represent the index of the unsuccessful cases
$i:y_i=1$ seems to mean taking the index over the successful cases, $i^prime:y_{i^prime}=0$ seems to mean taking the index over the unsuccessful cases,- The whole expression is saying that the likelihood of the observations is (proportional to) the product of the probabilities of the successful cases being successful and the unsuccessful cases being unsuccessful assuming the probabilities follow the model family. Presumably the next step is to find the $beta_0$, $beta_1$ which maximise this likelihood
add a comment |
up vote
1
down vote
It is not the notation I would choose, but essentially you seem have some observations $x_a,x_b,ldots$ which produce successful responses, which seem to be shown as $y_a=1, y_b=1, ldots$, and some other observations $x^prime_j,x^prime_k,ldots$ which produce unsuccessful responses, which seem to be shown as $y^prime_j=0, y^prime_k=0, ldots$
The model is based of the probability that $x$ produces a successful response $y=1$ being expressed as $p(x) = frac{e^{beta_0+beta_1x}}{1+e^{beta_0+beta_1x}}$ and so the probability that $x$ produces an unsuccessful response $y=0$ being $1-p(x)$
In answer to your questions:
$prod$ means take the product over the index, in a similar way to $sum$ meaning the sum- In the first product $i$ seem to represent the index of the successful cases, while the second product $i^prime$ seem to represent the index of the unsuccessful cases
$i:y_i=1$ seems to mean taking the index over the successful cases, $i^prime:y_{i^prime}=0$ seems to mean taking the index over the unsuccessful cases,- The whole expression is saying that the likelihood of the observations is (proportional to) the product of the probabilities of the successful cases being successful and the unsuccessful cases being unsuccessful assuming the probabilities follow the model family. Presumably the next step is to find the $beta_0$, $beta_1$ which maximise this likelihood
add a comment |
up vote
1
down vote
up vote
1
down vote
It is not the notation I would choose, but essentially you seem have some observations $x_a,x_b,ldots$ which produce successful responses, which seem to be shown as $y_a=1, y_b=1, ldots$, and some other observations $x^prime_j,x^prime_k,ldots$ which produce unsuccessful responses, which seem to be shown as $y^prime_j=0, y^prime_k=0, ldots$
The model is based of the probability that $x$ produces a successful response $y=1$ being expressed as $p(x) = frac{e^{beta_0+beta_1x}}{1+e^{beta_0+beta_1x}}$ and so the probability that $x$ produces an unsuccessful response $y=0$ being $1-p(x)$
In answer to your questions:
$prod$ means take the product over the index, in a similar way to $sum$ meaning the sum- In the first product $i$ seem to represent the index of the successful cases, while the second product $i^prime$ seem to represent the index of the unsuccessful cases
$i:y_i=1$ seems to mean taking the index over the successful cases, $i^prime:y_{i^prime}=0$ seems to mean taking the index over the unsuccessful cases,- The whole expression is saying that the likelihood of the observations is (proportional to) the product of the probabilities of the successful cases being successful and the unsuccessful cases being unsuccessful assuming the probabilities follow the model family. Presumably the next step is to find the $beta_0$, $beta_1$ which maximise this likelihood
It is not the notation I would choose, but essentially you seem have some observations $x_a,x_b,ldots$ which produce successful responses, which seem to be shown as $y_a=1, y_b=1, ldots$, and some other observations $x^prime_j,x^prime_k,ldots$ which produce unsuccessful responses, which seem to be shown as $y^prime_j=0, y^prime_k=0, ldots$
The model is based of the probability that $x$ produces a successful response $y=1$ being expressed as $p(x) = frac{e^{beta_0+beta_1x}}{1+e^{beta_0+beta_1x}}$ and so the probability that $x$ produces an unsuccessful response $y=0$ being $1-p(x)$
In answer to your questions:
$prod$ means take the product over the index, in a similar way to $sum$ meaning the sum- In the first product $i$ seem to represent the index of the successful cases, while the second product $i^prime$ seem to represent the index of the unsuccessful cases
$i:y_i=1$ seems to mean taking the index over the successful cases, $i^prime:y_{i^prime}=0$ seems to mean taking the index over the unsuccessful cases,- The whole expression is saying that the likelihood of the observations is (proportional to) the product of the probabilities of the successful cases being successful and the unsuccessful cases being unsuccessful assuming the probabilities follow the model family. Presumably the next step is to find the $beta_0$, $beta_1$ which maximise this likelihood
answered Nov 13 at 18:59
Henry
96.7k474154
96.7k474154
add a comment |
add a comment |
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2997109%2fhelp-with-likelihood-function-notation-in-logistic-regression%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown