Histogram bin size











up vote
0
down vote

favorite












I have a code like this and I am wondering why my bin size of the two plotted graphs is different?



import matplotlib.pyplot as pyplot
bins=15
pyplot.rcParams["figure.figsize"] = (10,10)

#echte_Ladezeit
pyplot.hist(Y_test, bins, alpha=1, label='Y_test; orange Dateien',
color='orange', weights = np.ones_like(Y_test)/float(len(Y_test)))
pyplot.hist(Y_train, bins, alpha=1, label='Y_train; grüne Dateien',
color='green', weights = np.ones_like(Y_train)/float(len(Y_train)))
pyplot.title('Verteilung echte_Ladezeit')
pyplot.xlabel('echte_Ladezeit')
pyplot.ylabel('Häufigkeit [%]')
pyplot.legend(loc='upper right')
pyplot.show()


actually the marked width of the orange and the green one should be the same right? Do I have any mistake in my code?
enter image description here










share|improve this question


























    up vote
    0
    down vote

    favorite












    I have a code like this and I am wondering why my bin size of the two plotted graphs is different?



    import matplotlib.pyplot as pyplot
    bins=15
    pyplot.rcParams["figure.figsize"] = (10,10)

    #echte_Ladezeit
    pyplot.hist(Y_test, bins, alpha=1, label='Y_test; orange Dateien',
    color='orange', weights = np.ones_like(Y_test)/float(len(Y_test)))
    pyplot.hist(Y_train, bins, alpha=1, label='Y_train; grüne Dateien',
    color='green', weights = np.ones_like(Y_train)/float(len(Y_train)))
    pyplot.title('Verteilung echte_Ladezeit')
    pyplot.xlabel('echte_Ladezeit')
    pyplot.ylabel('Häufigkeit [%]')
    pyplot.legend(loc='upper right')
    pyplot.show()


    actually the marked width of the orange and the green one should be the same right? Do I have any mistake in my code?
    enter image description here










    share|improve this question
























      up vote
      0
      down vote

      favorite









      up vote
      0
      down vote

      favorite











      I have a code like this and I am wondering why my bin size of the two plotted graphs is different?



      import matplotlib.pyplot as pyplot
      bins=15
      pyplot.rcParams["figure.figsize"] = (10,10)

      #echte_Ladezeit
      pyplot.hist(Y_test, bins, alpha=1, label='Y_test; orange Dateien',
      color='orange', weights = np.ones_like(Y_test)/float(len(Y_test)))
      pyplot.hist(Y_train, bins, alpha=1, label='Y_train; grüne Dateien',
      color='green', weights = np.ones_like(Y_train)/float(len(Y_train)))
      pyplot.title('Verteilung echte_Ladezeit')
      pyplot.xlabel('echte_Ladezeit')
      pyplot.ylabel('Häufigkeit [%]')
      pyplot.legend(loc='upper right')
      pyplot.show()


      actually the marked width of the orange and the green one should be the same right? Do I have any mistake in my code?
      enter image description here










      share|improve this question













      I have a code like this and I am wondering why my bin size of the two plotted graphs is different?



      import matplotlib.pyplot as pyplot
      bins=15
      pyplot.rcParams["figure.figsize"] = (10,10)

      #echte_Ladezeit
      pyplot.hist(Y_test, bins, alpha=1, label='Y_test; orange Dateien',
      color='orange', weights = np.ones_like(Y_test)/float(len(Y_test)))
      pyplot.hist(Y_train, bins, alpha=1, label='Y_train; grüne Dateien',
      color='green', weights = np.ones_like(Y_train)/float(len(Y_train)))
      pyplot.title('Verteilung echte_Ladezeit')
      pyplot.xlabel('echte_Ladezeit')
      pyplot.ylabel('Häufigkeit [%]')
      pyplot.legend(loc='upper right')
      pyplot.show()


      actually the marked width of the orange and the green one should be the same right? Do I have any mistake in my code?
      enter image description here







      python pandas histogram






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Nov 13 at 9:17









      raffa_sa

      1187




      1187
























          1 Answer
          1






          active

          oldest

          votes

















          up vote
          3
          down vote













          Your code contains pyplot.hist(..., bins, ...) where bins = 15. This means 15 bins equally spaced between max and min values. Max and min values are different for two datasets so you get different sets of 15 bins. If you want to get bins of equal width for every dataset then you have at least two options.




          1. Normalize datasets - max and min values should be the same for both datasets.


          2. Define bins as a sequence (for example, list(range(0, 40000 + 1, 5000))) as described here.







          share|improve this answer























            Your Answer






            StackExchange.ifUsing("editor", function () {
            StackExchange.using("externalEditor", function () {
            StackExchange.using("snippets", function () {
            StackExchange.snippets.init();
            });
            });
            }, "code-snippets");

            StackExchange.ready(function() {
            var channelOptions = {
            tags: "".split(" "),
            id: "1"
            };
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function() {
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled) {
            StackExchange.using("snippets", function() {
            createEditor();
            });
            }
            else {
            createEditor();
            }
            });

            function createEditor() {
            StackExchange.prepareEditor({
            heartbeatType: 'answer',
            convertImagesToLinks: true,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: 10,
            bindNavPrevention: true,
            postfix: "",
            imageUploader: {
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            },
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            });


            }
            });














             

            draft saved


            draft discarded


















            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53277555%2fhistogram-bin-size%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes








            up vote
            3
            down vote













            Your code contains pyplot.hist(..., bins, ...) where bins = 15. This means 15 bins equally spaced between max and min values. Max and min values are different for two datasets so you get different sets of 15 bins. If you want to get bins of equal width for every dataset then you have at least two options.




            1. Normalize datasets - max and min values should be the same for both datasets.


            2. Define bins as a sequence (for example, list(range(0, 40000 + 1, 5000))) as described here.







            share|improve this answer



























              up vote
              3
              down vote













              Your code contains pyplot.hist(..., bins, ...) where bins = 15. This means 15 bins equally spaced between max and min values. Max and min values are different for two datasets so you get different sets of 15 bins. If you want to get bins of equal width for every dataset then you have at least two options.




              1. Normalize datasets - max and min values should be the same for both datasets.


              2. Define bins as a sequence (for example, list(range(0, 40000 + 1, 5000))) as described here.







              share|improve this answer

























                up vote
                3
                down vote










                up vote
                3
                down vote









                Your code contains pyplot.hist(..., bins, ...) where bins = 15. This means 15 bins equally spaced between max and min values. Max and min values are different for two datasets so you get different sets of 15 bins. If you want to get bins of equal width for every dataset then you have at least two options.




                1. Normalize datasets - max and min values should be the same for both datasets.


                2. Define bins as a sequence (for example, list(range(0, 40000 + 1, 5000))) as described here.







                share|improve this answer














                Your code contains pyplot.hist(..., bins, ...) where bins = 15. This means 15 bins equally spaced between max and min values. Max and min values are different for two datasets so you get different sets of 15 bins. If you want to get bins of equal width for every dataset then you have at least two options.




                1. Normalize datasets - max and min values should be the same for both datasets.


                2. Define bins as a sequence (for example, list(range(0, 40000 + 1, 5000))) as described here.








                share|improve this answer














                share|improve this answer



                share|improve this answer








                edited Nov 13 at 9:43









                Mohamed Thasin ah

                3,25831237




                3,25831237










                answered Nov 13 at 9:42









                Poolka

                1,024128




                1,024128






























                     

                    draft saved


                    draft discarded



















































                     


                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function () {
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53277555%2fhistogram-bin-size%23new-answer', 'question_page');
                    }
                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    How to change which sound is reproduced for terminal bell?

                    Can I use Tabulator js library in my java Spring + Thymeleaf project?

                    Title Spacing in Bjornstrup Chapter, Removing Chapter Number From Contents