What are the numbers in Tesseract box file?
I cannot for the love of me find any documentation about how Tesseract box files work, and what the coordinates represent.
For instance, I'm getting:
T 2768 165 2789 191 0
The first token is obviously the character. I know that Tesseract uses bottom-left. 2768
should therefore be the bottom. The 4th token (2789
) seems to be the top. I don't get what the 3rd (165
), 5th (191
), and 6th (0
) tokens are. 165
and 191
are incorrect as left/right coordinates, and 0
I have no idea what it refers to.
Can anyone help me? Are these pixel coordinates, or do I have to factor in the DPI of the image?
Thanks!
image-processing ocr tesseract
add a comment |
I cannot for the love of me find any documentation about how Tesseract box files work, and what the coordinates represent.
For instance, I'm getting:
T 2768 165 2789 191 0
The first token is obviously the character. I know that Tesseract uses bottom-left. 2768
should therefore be the bottom. The 4th token (2789
) seems to be the top. I don't get what the 3rd (165
), 5th (191
), and 6th (0
) tokens are. 165
and 191
are incorrect as left/right coordinates, and 0
I have no idea what it refers to.
Can anyone help me? Are these pixel coordinates, or do I have to factor in the DPI of the image?
Thanks!
image-processing ocr tesseract
add a comment |
I cannot for the love of me find any documentation about how Tesseract box files work, and what the coordinates represent.
For instance, I'm getting:
T 2768 165 2789 191 0
The first token is obviously the character. I know that Tesseract uses bottom-left. 2768
should therefore be the bottom. The 4th token (2789
) seems to be the top. I don't get what the 3rd (165
), 5th (191
), and 6th (0
) tokens are. 165
and 191
are incorrect as left/right coordinates, and 0
I have no idea what it refers to.
Can anyone help me? Are these pixel coordinates, or do I have to factor in the DPI of the image?
Thanks!
image-processing ocr tesseract
I cannot for the love of me find any documentation about how Tesseract box files work, and what the coordinates represent.
For instance, I'm getting:
T 2768 165 2789 191 0
The first token is obviously the character. I know that Tesseract uses bottom-left. 2768
should therefore be the bottom. The 4th token (2789
) seems to be the top. I don't get what the 3rd (165
), 5th (191
), and 6th (0
) tokens are. 165
and 191
are incorrect as left/right coordinates, and 0
I have no idea what it refers to.
Can anyone help me? Are these pixel coordinates, or do I have to factor in the DPI of the image?
Thanks!
image-processing ocr tesseract
image-processing ocr tesseract
asked Nov 19 '18 at 23:16
nkkollawnkkollaw
550719
550719
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
According to documentation, the format for each line is
<symbol> <left> <bottom> <right> <top> <page>
Where:
<symbol>
is the character e.g. a or b.
<left> <bottom> <right> <top>
are the coordinates of the rectangle that fits the character on the page. Note that the coordinates system used by Tesseract has (0,0) in the bottom-left corner of the image!
<page>
is only relevant if you’re using multi-page TIFF files. In all other cases just put 0 in here.
So in your particular case
T 2768 165 2789 191 0
would be
- character:
T
- left:
2768
- bottom:
165
- right:
2789
- top:
191
- page:
0
Ha! Thanks for linking the documentation! I guess the <bottom> and <left> coordinates are wrong, then. It's just a character, so if that's pixels or points the numbers should be very similar. I'll look at the docs to see if I can find out more. Thanks!!
– nkkollaw
Nov 19 '18 at 23:23
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53384003%2fwhat-are-the-numbers-in-tesseract-box-file%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
According to documentation, the format for each line is
<symbol> <left> <bottom> <right> <top> <page>
Where:
<symbol>
is the character e.g. a or b.
<left> <bottom> <right> <top>
are the coordinates of the rectangle that fits the character on the page. Note that the coordinates system used by Tesseract has (0,0) in the bottom-left corner of the image!
<page>
is only relevant if you’re using multi-page TIFF files. In all other cases just put 0 in here.
So in your particular case
T 2768 165 2789 191 0
would be
- character:
T
- left:
2768
- bottom:
165
- right:
2789
- top:
191
- page:
0
Ha! Thanks for linking the documentation! I guess the <bottom> and <left> coordinates are wrong, then. It's just a character, so if that's pixels or points the numbers should be very similar. I'll look at the docs to see if I can find out more. Thanks!!
– nkkollaw
Nov 19 '18 at 23:23
add a comment |
According to documentation, the format for each line is
<symbol> <left> <bottom> <right> <top> <page>
Where:
<symbol>
is the character e.g. a or b.
<left> <bottom> <right> <top>
are the coordinates of the rectangle that fits the character on the page. Note that the coordinates system used by Tesseract has (0,0) in the bottom-left corner of the image!
<page>
is only relevant if you’re using multi-page TIFF files. In all other cases just put 0 in here.
So in your particular case
T 2768 165 2789 191 0
would be
- character:
T
- left:
2768
- bottom:
165
- right:
2789
- top:
191
- page:
0
Ha! Thanks for linking the documentation! I guess the <bottom> and <left> coordinates are wrong, then. It's just a character, so if that's pixels or points the numbers should be very similar. I'll look at the docs to see if I can find out more. Thanks!!
– nkkollaw
Nov 19 '18 at 23:23
add a comment |
According to documentation, the format for each line is
<symbol> <left> <bottom> <right> <top> <page>
Where:
<symbol>
is the character e.g. a or b.
<left> <bottom> <right> <top>
are the coordinates of the rectangle that fits the character on the page. Note that the coordinates system used by Tesseract has (0,0) in the bottom-left corner of the image!
<page>
is only relevant if you’re using multi-page TIFF files. In all other cases just put 0 in here.
So in your particular case
T 2768 165 2789 191 0
would be
- character:
T
- left:
2768
- bottom:
165
- right:
2789
- top:
191
- page:
0
According to documentation, the format for each line is
<symbol> <left> <bottom> <right> <top> <page>
Where:
<symbol>
is the character e.g. a or b.
<left> <bottom> <right> <top>
are the coordinates of the rectangle that fits the character on the page. Note that the coordinates system used by Tesseract has (0,0) in the bottom-left corner of the image!
<page>
is only relevant if you’re using multi-page TIFF files. In all other cases just put 0 in here.
So in your particular case
T 2768 165 2789 191 0
would be
- character:
T
- left:
2768
- bottom:
165
- right:
2789
- top:
191
- page:
0
answered Nov 19 '18 at 23:21
Michael JasperMichael Jasper
6,32712954
6,32712954
Ha! Thanks for linking the documentation! I guess the <bottom> and <left> coordinates are wrong, then. It's just a character, so if that's pixels or points the numbers should be very similar. I'll look at the docs to see if I can find out more. Thanks!!
– nkkollaw
Nov 19 '18 at 23:23
add a comment |
Ha! Thanks for linking the documentation! I guess the <bottom> and <left> coordinates are wrong, then. It's just a character, so if that's pixels or points the numbers should be very similar. I'll look at the docs to see if I can find out more. Thanks!!
– nkkollaw
Nov 19 '18 at 23:23
Ha! Thanks for linking the documentation! I guess the <bottom> and <left> coordinates are wrong, then. It's just a character, so if that's pixels or points the numbers should be very similar. I'll look at the docs to see if I can find out more. Thanks!!
– nkkollaw
Nov 19 '18 at 23:23
Ha! Thanks for linking the documentation! I guess the <bottom> and <left> coordinates are wrong, then. It's just a character, so if that's pixels or points the numbers should be very similar. I'll look at the docs to see if I can find out more. Thanks!!
– nkkollaw
Nov 19 '18 at 23:23
add a comment |
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53384003%2fwhat-are-the-numbers-in-tesseract-box-file%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown