Error while trying to use --boundary-query argument in sqoop import











up vote
0
down vote

favorite












I am learning sqoop on my own and tried to run the below mentioned code to retrieve the first 3000 records from the database and evenly split by primary key emp_no



 sqoop import 

--connect jdbc:mysql://localhost/employees

--username root

-P

--query 'select * from employees WHERE $CONDITIONS ORDER BY emp_no LIMIT 3000'

--split-by emp_no

-m 3

--target-dir sqoop/import_data/employee_db_import

--delete-target-dir


The above statements yielded evenly distributed results 1000 records per mapper.



Now for further learning I added the --boundary-query argument as



 --boundary-query 'select MIN(emp_no),MAX(emp_no) from employees' 


to the above statement and the map reduce job is now reading 9000 records from the database. I want to know why is this happening ?










share|improve this question


























    up vote
    0
    down vote

    favorite












    I am learning sqoop on my own and tried to run the below mentioned code to retrieve the first 3000 records from the database and evenly split by primary key emp_no



     sqoop import 

    --connect jdbc:mysql://localhost/employees

    --username root

    -P

    --query 'select * from employees WHERE $CONDITIONS ORDER BY emp_no LIMIT 3000'

    --split-by emp_no

    -m 3

    --target-dir sqoop/import_data/employee_db_import

    --delete-target-dir


    The above statements yielded evenly distributed results 1000 records per mapper.



    Now for further learning I added the --boundary-query argument as



     --boundary-query 'select MIN(emp_no),MAX(emp_no) from employees' 


    to the above statement and the map reduce job is now reading 9000 records from the database. I want to know why is this happening ?










    share|improve this question
























      up vote
      0
      down vote

      favorite









      up vote
      0
      down vote

      favorite











      I am learning sqoop on my own and tried to run the below mentioned code to retrieve the first 3000 records from the database and evenly split by primary key emp_no



       sqoop import 

      --connect jdbc:mysql://localhost/employees

      --username root

      -P

      --query 'select * from employees WHERE $CONDITIONS ORDER BY emp_no LIMIT 3000'

      --split-by emp_no

      -m 3

      --target-dir sqoop/import_data/employee_db_import

      --delete-target-dir


      The above statements yielded evenly distributed results 1000 records per mapper.



      Now for further learning I added the --boundary-query argument as



       --boundary-query 'select MIN(emp_no),MAX(emp_no) from employees' 


      to the above statement and the map reduce job is now reading 9000 records from the database. I want to know why is this happening ?










      share|improve this question













      I am learning sqoop on my own and tried to run the below mentioned code to retrieve the first 3000 records from the database and evenly split by primary key emp_no



       sqoop import 

      --connect jdbc:mysql://localhost/employees

      --username root

      -P

      --query 'select * from employees WHERE $CONDITIONS ORDER BY emp_no LIMIT 3000'

      --split-by emp_no

      -m 3

      --target-dir sqoop/import_data/employee_db_import

      --delete-target-dir


      The above statements yielded evenly distributed results 1000 records per mapper.



      Now for further learning I added the --boundary-query argument as



       --boundary-query 'select MIN(emp_no),MAX(emp_no) from employees' 


      to the above statement and the map reduce job is now reading 9000 records from the database. I want to know why is this happening ?







      mysql hadoop mapreduce sqoop






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked yesterday









      Sarvagya Dubey

      699




      699





























          active

          oldest

          votes











          Your Answer






          StackExchange.ifUsing("editor", function () {
          StackExchange.using("externalEditor", function () {
          StackExchange.using("snippets", function () {
          StackExchange.snippets.init();
          });
          });
          }, "code-snippets");

          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "1"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });














           

          draft saved


          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53265727%2ferror-while-trying-to-use-boundary-query-argument-in-sqoop-import%23new-answer', 'question_page');
          }
          );

          Post as a guest





































          active

          oldest

          votes













          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes
















           

          draft saved


          draft discarded



















































           


          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53265727%2ferror-while-trying-to-use-boundary-query-argument-in-sqoop-import%23new-answer', 'question_page');
          }
          );

          Post as a guest




















































































          Popular posts from this blog

          How to change which sound is reproduced for terminal bell?

          Can I use Tabulator js library in my java Spring + Thymeleaf project?

          Title Spacing in Bjornstrup Chapter, Removing Chapter Number From Contents