Help with Likelihood Function Notation in Logistic Regression











up vote
2
down vote

favorite












Can someone please help me understand below notation I encountered while studying logistic regression? I am pretty sure I lack the mathematical maturity but would like to give it a try.



$l(beta_0,beta_1) = prod_{i:y_{i=1}} p(x_i) prod_{i':y_{i'=0}} (1-p(x_i'))$



where $p(X)$ = $frac{e^{beta_0+beta_1X}}{1+e^{beta_0+beta_1X}}$




  1. What does $prod$ notation stand for?

  2. What does the index $i$ comprise of?

  3. What does the lower bound $i:y_{i=1}$ mean?

  4. What does the first equation mean on the whole?


Please let me know if I missed providing any context information.










share|cite|improve this question


























    up vote
    2
    down vote

    favorite












    Can someone please help me understand below notation I encountered while studying logistic regression? I am pretty sure I lack the mathematical maturity but would like to give it a try.



    $l(beta_0,beta_1) = prod_{i:y_{i=1}} p(x_i) prod_{i':y_{i'=0}} (1-p(x_i'))$



    where $p(X)$ = $frac{e^{beta_0+beta_1X}}{1+e^{beta_0+beta_1X}}$




    1. What does $prod$ notation stand for?

    2. What does the index $i$ comprise of?

    3. What does the lower bound $i:y_{i=1}$ mean?

    4. What does the first equation mean on the whole?


    Please let me know if I missed providing any context information.










    share|cite|improve this question
























      up vote
      2
      down vote

      favorite









      up vote
      2
      down vote

      favorite











      Can someone please help me understand below notation I encountered while studying logistic regression? I am pretty sure I lack the mathematical maturity but would like to give it a try.



      $l(beta_0,beta_1) = prod_{i:y_{i=1}} p(x_i) prod_{i':y_{i'=0}} (1-p(x_i'))$



      where $p(X)$ = $frac{e^{beta_0+beta_1X}}{1+e^{beta_0+beta_1X}}$




      1. What does $prod$ notation stand for?

      2. What does the index $i$ comprise of?

      3. What does the lower bound $i:y_{i=1}$ mean?

      4. What does the first equation mean on the whole?


      Please let me know if I missed providing any context information.










      share|cite|improve this question













      Can someone please help me understand below notation I encountered while studying logistic regression? I am pretty sure I lack the mathematical maturity but would like to give it a try.



      $l(beta_0,beta_1) = prod_{i:y_{i=1}} p(x_i) prod_{i':y_{i'=0}} (1-p(x_i'))$



      where $p(X)$ = $frac{e^{beta_0+beta_1X}}{1+e^{beta_0+beta_1X}}$




      1. What does $prod$ notation stand for?

      2. What does the index $i$ comprise of?

      3. What does the lower bound $i:y_{i=1}$ mean?

      4. What does the first equation mean on the whole?


      Please let me know if I missed providing any context information.







      statistics






      share|cite|improve this question













      share|cite|improve this question











      share|cite|improve this question




      share|cite|improve this question










      asked Nov 13 at 18:34









      DoLare

      111




      111






















          1 Answer
          1






          active

          oldest

          votes

















          up vote
          1
          down vote













          It is not the notation I would choose, but essentially you seem have some observations $x_a,x_b,ldots$ which produce successful responses, which seem to be shown as $y_a=1, y_b=1, ldots$, and some other observations $x^prime_j,x^prime_k,ldots$ which produce unsuccessful responses, which seem to be shown as $y^prime_j=0, y^prime_k=0, ldots$



          The model is based of the probability that $x$ produces a successful response $y=1$ being expressed as $p(x) = frac{e^{beta_0+beta_1x}}{1+e^{beta_0+beta_1x}}$ and so the probability that $x$ produces an unsuccessful response $y=0$ being $1-p(x)$



          In answer to your questions:





          1. $prod$ means take the product over the index, in a similar way to $sum$ meaning the sum

          2. In the first product $i$ seem to represent the index of the successful cases, while the second product $i^prime$ seem to represent the index of the unsuccessful cases


          3. $i:y_i=1$ seems to mean taking the index over the successful cases, $i^prime:y_{i^prime}=0$ seems to mean taking the index over the unsuccessful cases,

          4. The whole expression is saying that the likelihood of the observations is (proportional to) the product of the probabilities of the successful cases being successful and the unsuccessful cases being unsuccessful assuming the probabilities follow the model family. Presumably the next step is to find the $beta_0$, $beta_1$ which maximise this likelihood






          share|cite|improve this answer





















            Your Answer





            StackExchange.ifUsing("editor", function () {
            return StackExchange.using("mathjaxEditing", function () {
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
            });
            });
            }, "mathjax-editing");

            StackExchange.ready(function() {
            var channelOptions = {
            tags: "".split(" "),
            id: "69"
            };
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function() {
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled) {
            StackExchange.using("snippets", function() {
            createEditor();
            });
            }
            else {
            createEditor();
            }
            });

            function createEditor() {
            StackExchange.prepareEditor({
            heartbeatType: 'answer',
            convertImagesToLinks: true,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: 10,
            bindNavPrevention: true,
            postfix: "",
            imageUploader: {
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            },
            noCode: true, onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            });


            }
            });














             

            draft saved


            draft discarded


















            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2997109%2fhelp-with-likelihood-function-notation-in-logistic-regression%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes








            up vote
            1
            down vote













            It is not the notation I would choose, but essentially you seem have some observations $x_a,x_b,ldots$ which produce successful responses, which seem to be shown as $y_a=1, y_b=1, ldots$, and some other observations $x^prime_j,x^prime_k,ldots$ which produce unsuccessful responses, which seem to be shown as $y^prime_j=0, y^prime_k=0, ldots$



            The model is based of the probability that $x$ produces a successful response $y=1$ being expressed as $p(x) = frac{e^{beta_0+beta_1x}}{1+e^{beta_0+beta_1x}}$ and so the probability that $x$ produces an unsuccessful response $y=0$ being $1-p(x)$



            In answer to your questions:





            1. $prod$ means take the product over the index, in a similar way to $sum$ meaning the sum

            2. In the first product $i$ seem to represent the index of the successful cases, while the second product $i^prime$ seem to represent the index of the unsuccessful cases


            3. $i:y_i=1$ seems to mean taking the index over the successful cases, $i^prime:y_{i^prime}=0$ seems to mean taking the index over the unsuccessful cases,

            4. The whole expression is saying that the likelihood of the observations is (proportional to) the product of the probabilities of the successful cases being successful and the unsuccessful cases being unsuccessful assuming the probabilities follow the model family. Presumably the next step is to find the $beta_0$, $beta_1$ which maximise this likelihood






            share|cite|improve this answer

























              up vote
              1
              down vote













              It is not the notation I would choose, but essentially you seem have some observations $x_a,x_b,ldots$ which produce successful responses, which seem to be shown as $y_a=1, y_b=1, ldots$, and some other observations $x^prime_j,x^prime_k,ldots$ which produce unsuccessful responses, which seem to be shown as $y^prime_j=0, y^prime_k=0, ldots$



              The model is based of the probability that $x$ produces a successful response $y=1$ being expressed as $p(x) = frac{e^{beta_0+beta_1x}}{1+e^{beta_0+beta_1x}}$ and so the probability that $x$ produces an unsuccessful response $y=0$ being $1-p(x)$



              In answer to your questions:





              1. $prod$ means take the product over the index, in a similar way to $sum$ meaning the sum

              2. In the first product $i$ seem to represent the index of the successful cases, while the second product $i^prime$ seem to represent the index of the unsuccessful cases


              3. $i:y_i=1$ seems to mean taking the index over the successful cases, $i^prime:y_{i^prime}=0$ seems to mean taking the index over the unsuccessful cases,

              4. The whole expression is saying that the likelihood of the observations is (proportional to) the product of the probabilities of the successful cases being successful and the unsuccessful cases being unsuccessful assuming the probabilities follow the model family. Presumably the next step is to find the $beta_0$, $beta_1$ which maximise this likelihood






              share|cite|improve this answer























                up vote
                1
                down vote










                up vote
                1
                down vote









                It is not the notation I would choose, but essentially you seem have some observations $x_a,x_b,ldots$ which produce successful responses, which seem to be shown as $y_a=1, y_b=1, ldots$, and some other observations $x^prime_j,x^prime_k,ldots$ which produce unsuccessful responses, which seem to be shown as $y^prime_j=0, y^prime_k=0, ldots$



                The model is based of the probability that $x$ produces a successful response $y=1$ being expressed as $p(x) = frac{e^{beta_0+beta_1x}}{1+e^{beta_0+beta_1x}}$ and so the probability that $x$ produces an unsuccessful response $y=0$ being $1-p(x)$



                In answer to your questions:





                1. $prod$ means take the product over the index, in a similar way to $sum$ meaning the sum

                2. In the first product $i$ seem to represent the index of the successful cases, while the second product $i^prime$ seem to represent the index of the unsuccessful cases


                3. $i:y_i=1$ seems to mean taking the index over the successful cases, $i^prime:y_{i^prime}=0$ seems to mean taking the index over the unsuccessful cases,

                4. The whole expression is saying that the likelihood of the observations is (proportional to) the product of the probabilities of the successful cases being successful and the unsuccessful cases being unsuccessful assuming the probabilities follow the model family. Presumably the next step is to find the $beta_0$, $beta_1$ which maximise this likelihood






                share|cite|improve this answer












                It is not the notation I would choose, but essentially you seem have some observations $x_a,x_b,ldots$ which produce successful responses, which seem to be shown as $y_a=1, y_b=1, ldots$, and some other observations $x^prime_j,x^prime_k,ldots$ which produce unsuccessful responses, which seem to be shown as $y^prime_j=0, y^prime_k=0, ldots$



                The model is based of the probability that $x$ produces a successful response $y=1$ being expressed as $p(x) = frac{e^{beta_0+beta_1x}}{1+e^{beta_0+beta_1x}}$ and so the probability that $x$ produces an unsuccessful response $y=0$ being $1-p(x)$



                In answer to your questions:





                1. $prod$ means take the product over the index, in a similar way to $sum$ meaning the sum

                2. In the first product $i$ seem to represent the index of the successful cases, while the second product $i^prime$ seem to represent the index of the unsuccessful cases


                3. $i:y_i=1$ seems to mean taking the index over the successful cases, $i^prime:y_{i^prime}=0$ seems to mean taking the index over the unsuccessful cases,

                4. The whole expression is saying that the likelihood of the observations is (proportional to) the product of the probabilities of the successful cases being successful and the unsuccessful cases being unsuccessful assuming the probabilities follow the model family. Presumably the next step is to find the $beta_0$, $beta_1$ which maximise this likelihood







                share|cite|improve this answer












                share|cite|improve this answer



                share|cite|improve this answer










                answered Nov 13 at 18:59









                Henry

                96.7k474154




                96.7k474154






























                     

                    draft saved


                    draft discarded



















































                     


                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function () {
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2997109%2fhelp-with-likelihood-function-notation-in-logistic-regression%23new-answer', 'question_page');
                    }
                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    How to change which sound is reproduced for terminal bell?

                    Can I use Tabulator js library in my java Spring + Thymeleaf project?

                    Title Spacing in Bjornstrup Chapter, Removing Chapter Number From Contents