How to count the characters of jar files by wc

Under the folder /usr/hdp/2.6.4.0-91/tez_hive2/lib/
We have .jar files as the following

$ ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar"

async-http-client-1.8.16.jar

azure-data-lake-store-sdk-2.1.4.jar

commons-cli-1.2.jar

commons-codec-1.4.jar

commons-collections-3.2.2.jar

commons-collections4-4.1.jar

commons-io-2.4.jar

commons-lang-2.6.jar

commons-math3-3.1.1.jar

guava-11.0.2.jar

hadoop-aws-2.7.3.2.6.4.0-91.jar

hadoop-azure-2.7.3.2.6.4.0-91.jar

hadoop-azure-datalake-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-common-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-core-2.7.3.2.6.4.0-91.jar

hadoop-yarn-server-timeline-pluginstorage-2.7.3.2.6.4.0-91.jar

jersey-client-1.9.jar

jersey-json-1.9.jar

jettison-1.3.4.jar

jetty-6.1.26.hwx.jar

jetty-util-6.1.26.hwx.jar

jsr305-3.0.0.jar

metrics-core-3.1.0.jar

protobuf-java-2.5.0.jar

RoaringBitmap-0.4.9.jar

servlet-api-2.5.jar

slf4j-api-1.7.10.jar

I want to count all characters from the .jar files by wc , in order to understand if .jar files renamed

So I do the following command in order to count all characters from all .jar files

ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

758

So in this case we get 758 characters from all .jars

But the command isn’t elegant

How we can improve the command to be better?

edited Feb 28 at 19:57

jimmij

32.1k874108

asked Feb 28 at 19:03

yael

2,68722572

1

You want to get total string length of all the filename? That sounds like a strange request. Smells like an XY problem.

– glenn jackman
Feb 28 at 19:08

what I want it to count all characters from the output by wc or any other suggestion

– yael
Feb 28 at 19:17

1) Wouldn't it make sense to get the length of "each" filename? 2) That wouldn't really tell you a whole lot as it can be renamed to something with the same amount of letters. 3) There really isn't a way to tell if any of the files have been renamed unless a script runs something like ls -l | awk '{print $NF} > jarlog.txt so that you can compare the filenames.

– Nasir Riley
Feb 28 at 19:22

look only I want is to improve the cli - ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

– yael
Feb 28 at 19:24

1

printf '%sn' *.jar | md5sum wouldn't be blind to renaming to something with the same amount of letters.

– Kamil Maciorowski
Feb 28 at 19:32

|
show 5 more comments

Under the folder /usr/hdp/2.6.4.0-91/tez_hive2/lib/
We have .jar files as the following

$ ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar"

async-http-client-1.8.16.jar

azure-data-lake-store-sdk-2.1.4.jar

commons-cli-1.2.jar

commons-codec-1.4.jar

commons-collections-3.2.2.jar

commons-collections4-4.1.jar

commons-io-2.4.jar

commons-lang-2.6.jar

commons-math3-3.1.1.jar

guava-11.0.2.jar

hadoop-aws-2.7.3.2.6.4.0-91.jar

hadoop-azure-2.7.3.2.6.4.0-91.jar

hadoop-azure-datalake-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-common-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-core-2.7.3.2.6.4.0-91.jar

hadoop-yarn-server-timeline-pluginstorage-2.7.3.2.6.4.0-91.jar

jersey-client-1.9.jar

jersey-json-1.9.jar

jettison-1.3.4.jar

jetty-6.1.26.hwx.jar

jetty-util-6.1.26.hwx.jar

jsr305-3.0.0.jar

metrics-core-3.1.0.jar

protobuf-java-2.5.0.jar

RoaringBitmap-0.4.9.jar

servlet-api-2.5.jar

slf4j-api-1.7.10.jar

I want to count all characters from the .jar files by wc , in order to understand if .jar files renamed

So I do the following command in order to count all characters from all .jar files

ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

758

So in this case we get 758 characters from all .jars

But the command isn’t elegant

How we can improve the command to be better?

edited Feb 28 at 19:57

jimmij

32.1k874108

asked Feb 28 at 19:03

yael

2,68722572

1

You want to get total string length of all the filename? That sounds like a strange request. Smells like an XY problem.

– glenn jackman
Feb 28 at 19:08

what I want it to count all characters from the output by wc or any other suggestion

– yael
Feb 28 at 19:17

1) Wouldn't it make sense to get the length of "each" filename? 2) That wouldn't really tell you a whole lot as it can be renamed to something with the same amount of letters. 3) There really isn't a way to tell if any of the files have been renamed unless a script runs something like ls -l | awk '{print $NF} > jarlog.txt so that you can compare the filenames.

– Nasir Riley
Feb 28 at 19:22

look only I want is to improve the cli - ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

– yael
Feb 28 at 19:24

1

printf '%sn' *.jar | md5sum wouldn't be blind to renaming to something with the same amount of letters.

– Kamil Maciorowski
Feb 28 at 19:32

|
show 5 more comments

Under the folder /usr/hdp/2.6.4.0-91/tez_hive2/lib/
We have .jar files as the following

$ ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar"

async-http-client-1.8.16.jar

azure-data-lake-store-sdk-2.1.4.jar

commons-cli-1.2.jar

commons-codec-1.4.jar

commons-collections-3.2.2.jar

commons-collections4-4.1.jar

commons-io-2.4.jar

commons-lang-2.6.jar

commons-math3-3.1.1.jar

guava-11.0.2.jar

hadoop-aws-2.7.3.2.6.4.0-91.jar

hadoop-azure-2.7.3.2.6.4.0-91.jar

hadoop-azure-datalake-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-common-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-core-2.7.3.2.6.4.0-91.jar

hadoop-yarn-server-timeline-pluginstorage-2.7.3.2.6.4.0-91.jar

jersey-client-1.9.jar

jersey-json-1.9.jar

jettison-1.3.4.jar

jetty-6.1.26.hwx.jar

jetty-util-6.1.26.hwx.jar

jsr305-3.0.0.jar

metrics-core-3.1.0.jar

protobuf-java-2.5.0.jar

RoaringBitmap-0.4.9.jar

servlet-api-2.5.jar

slf4j-api-1.7.10.jar

I want to count all characters from the .jar files by wc , in order to understand if .jar files renamed

So I do the following command in order to count all characters from all .jar files

ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

758

So in this case we get 758 characters from all .jars

But the command isn’t elegant

How we can improve the command to be better?

edited Feb 28 at 19:57

jimmij

32.1k874108

asked Feb 28 at 19:03

yael

2,68722572

Under the folder /usr/hdp/2.6.4.0-91/tez_hive2/lib/
We have .jar files as the following

$ ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar"

async-http-client-1.8.16.jar

azure-data-lake-store-sdk-2.1.4.jar

commons-cli-1.2.jar

commons-codec-1.4.jar

commons-collections-3.2.2.jar

commons-collections4-4.1.jar

commons-io-2.4.jar

commons-lang-2.6.jar

commons-math3-3.1.1.jar

guava-11.0.2.jar

hadoop-aws-2.7.3.2.6.4.0-91.jar

hadoop-azure-2.7.3.2.6.4.0-91.jar

hadoop-azure-datalake-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-common-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-core-2.7.3.2.6.4.0-91.jar

hadoop-yarn-server-timeline-pluginstorage-2.7.3.2.6.4.0-91.jar

jersey-client-1.9.jar

jersey-json-1.9.jar

jettison-1.3.4.jar

jetty-6.1.26.hwx.jar

jetty-util-6.1.26.hwx.jar

jsr305-3.0.0.jar

metrics-core-3.1.0.jar

protobuf-java-2.5.0.jar

RoaringBitmap-0.4.9.jar

servlet-api-2.5.jar

slf4j-api-1.7.10.jar

I want to count all characters from the .jar files by wc , in order to understand if .jar files renamed

So I do the following command in order to count all characters from all .jar files

ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

758

So in this case we get 758 characters from all .jars

But the command isn’t elegant

How we can improve the command to be better?

bash shell-script awk sed wc

edited Feb 28 at 19:57

jimmij

32.1k874108

asked Feb 28 at 19:03

yael

2,68722572

edited Feb 28 at 19:57

jimmij

32.1k874108

asked Feb 28 at 19:03

yael

2,68722572

edited Feb 28 at 19:57

jimmij

32.1k874108

edited Feb 28 at 19:57

jimmij

32.1k874108

edited Feb 28 at 19:57

jimmij

32.1k874108

asked Feb 28 at 19:03

yael

2,68722572

asked Feb 28 at 19:03

yael

2,68722572

asked Feb 28 at 19:03

yael

2,68722572

1

You want to get total string length of all the filename? That sounds like a strange request. Smells like an XY problem.

– glenn jackman
Feb 28 at 19:08

what I want it to count all characters from the output by wc or any other suggestion

– yael
Feb 28 at 19:17

1) Wouldn't it make sense to get the length of "each" filename? 2) That wouldn't really tell you a whole lot as it can be renamed to something with the same amount of letters. 3) There really isn't a way to tell if any of the files have been renamed unless a script runs something like ls -l | awk '{print $NF} > jarlog.txt so that you can compare the filenames.

– Nasir Riley
Feb 28 at 19:22

look only I want is to improve the cli - ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

– yael
Feb 28 at 19:24

1

printf '%sn' *.jar | md5sum wouldn't be blind to renaming to something with the same amount of letters.

– Kamil Maciorowski
Feb 28 at 19:32

|
show 5 more comments

1

You want to get total string length of all the filename? That sounds like a strange request. Smells like an XY problem.

– glenn jackman
Feb 28 at 19:08

what I want it to count all characters from the output by wc or any other suggestion

– yael
Feb 28 at 19:17

1) Wouldn't it make sense to get the length of "each" filename? 2) That wouldn't really tell you a whole lot as it can be renamed to something with the same amount of letters. 3) There really isn't a way to tell if any of the files have been renamed unless a script runs something like ls -l | awk '{print $NF} > jarlog.txt so that you can compare the filenames.

– Nasir Riley
Feb 28 at 19:22

look only I want is to improve the cli - ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

– yael
Feb 28 at 19:24

1

printf '%sn' *.jar | md5sum wouldn't be blind to renaming to something with the same amount of letters.

– Kamil Maciorowski
Feb 28 at 19:32

You want to get total string length of all the filename? That sounds like a strange request. Smells like an XY problem.

– glenn jackman
Feb 28 at 19:08

what I want it to count all characters from the output by wc or any other suggestion

– yael
Feb 28 at 19:17

1) Wouldn't it make sense to get the length of "each" filename? 2) That wouldn't really tell you a whole lot as it can be renamed to something with the same amount of letters. 3) There really isn't a way to tell if any of the files have been renamed unless a script runs something like ls -l | awk '{print $NF} > jarlog.txt so that you can compare the filenames.

– Nasir Riley
Feb 28 at 19:22

look only I want is to improve the cli - ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

– yael
Feb 28 at 19:24

printf '%sn' *.jar | md5sum wouldn't be blind to renaming to something with the same amount of letters.

– Kamil Maciorowski
Feb 28 at 19:32

|
show 5 more comments

4 Answers
4

active

oldest

votes

Most probably you are looking for

basename -a /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | wc -c

The path with wildcard list all jar files, the basename command strips directories (-a is needed to accept many arguments), and wc -c just counts bytes (if some filenames consist of 2 byte characters then perhaps wc -m (characters count) would be a better choice).

However, to if the goal is to check if files have been modified then perhaps stat (for modification time) or md5sum/shasum for checksum would be a better tools.

edited Feb 28 at 19:54

answered Feb 28 at 19:39

jimmij

32.1k874108

1

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
Feb 28 at 19:47

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
Feb 28 at 19:51

2

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
Feb 28 at 19:53

add a comment |

Counting the number of letters in the filenames would not be a safe way of detecting a renamed filename.

Instead, create a simple file listing of the names, and compare it to an existing list. By using diff, you would be shown exactly which line(s) in the list had changed.

#!/bin/sh



LC_ALL=C



newlist=$HOME/filelist.new

oldlist=$HOME/filelist.old



echo /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar >"$newlist"



if [ -f "$oldlist" ]; then

    diff -u "$oldlist" "$newlist"

fi



mv "$newlist" "$oldlist"

Obviously, the first time you do this, filelist.old would not exist, so the diff would not run.

Note that I save the full path to each file in the output file. This does not matter since the directory path is static.

Change echo to ls -l if you want to also compare timestamps etc. Change it to stat if you want to compare even more meta data (this would generate diff output when even the last-access timestamp on a file changed). Install wdiff and change diff to wdiff to get a word-based diff rather than a line-based one.

The LC_ALL=C is to guarantee a consistent sorting of the expansion of the shell glob.

edited Feb 28 at 20:47

answered Feb 28 at 19:56

Kusalananda

134k17255418

add a comment |

To just get the number of characters in the (non-hidden) jar filenames, I would do

cd /usr/hdp/2.6.4.0-91/tez_hive2/lib/ && printf %s *.jar | wc -m

This will not count any newlines, just the filename characters (replace wc -m with wc -c for the number of bytes instead of characters).

Purposefully, I'm not parsing ls output.

edited Feb 28 at 19:49

Stéphane Chazelas

309k57582942

answered Feb 28 at 19:46

glenn jackman

52.2k572112

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
Feb 28 at 19:51

add a comment |

I don't understand the point of the sed command, you are replacing a slash with a space? Why?

Aside from that, it seems that you want to count the total number of characters in all the file names of the .jar files. IF so, try this:
ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | sed s'/// /g' | wc -c

Two other possibilities for your sed command:
sed "s|/| |g" -- or -- tr '/' ' '

Since your ls command won't show directory names, I am not sure you need it.

This will also count the LF at the end of each file name. Is that OK?

answered Feb 28 at 19:38

Scottie H

346

add a comment |

Your Answer

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "106"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f503621%2fhow-to-count-the-characters-of-jar-files-by-wc%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

4 Answers
4

active

oldest

votes

4 Answers
4

active

oldest

votes

Most probably you are looking for

basename -a /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | wc -c

However, to if the goal is to check if files have been modified then perhaps stat (for modification time) or md5sum/shasum for checksum would be a better tools.

edited Feb 28 at 19:54

answered Feb 28 at 19:39

jimmij

32.1k874108

1

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
Feb 28 at 19:47

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
Feb 28 at 19:51

2

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
Feb 28 at 19:53

add a comment |

Most probably you are looking for

basename -a /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | wc -c

However, to if the goal is to check if files have been modified then perhaps stat (for modification time) or md5sum/shasum for checksum would be a better tools.

edited Feb 28 at 19:54

answered Feb 28 at 19:39

jimmij

32.1k874108

1

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
Feb 28 at 19:47

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
Feb 28 at 19:51

2

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
Feb 28 at 19:53

add a comment |

Most probably you are looking for

basename -a /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | wc -c

However, to if the goal is to check if files have been modified then perhaps stat (for modification time) or md5sum/shasum for checksum would be a better tools.

edited Feb 28 at 19:54

answered Feb 28 at 19:39

jimmij

32.1k874108

Most probably you are looking for

basename -a /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | wc -c

However, to if the goal is to check if files have been modified then perhaps stat (for modification time) or md5sum/shasum for checksum would be a better tools.

edited Feb 28 at 19:54

answered Feb 28 at 19:39

jimmij

32.1k874108

edited Feb 28 at 19:54

answered Feb 28 at 19:39

jimmij

32.1k874108

answered Feb 28 at 19:39

jimmij

32.1k874108

answered Feb 28 at 19:39

jimmij

32.1k874108

1

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
Feb 28 at 19:47

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
Feb 28 at 19:51

2

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
Feb 28 at 19:53

add a comment |

1

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
Feb 28 at 19:47

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
Feb 28 at 19:51

2

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
Feb 28 at 19:53

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
Feb 28 at 19:47

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
Feb 28 at 19:51

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
Feb 28 at 19:53

add a comment |

Counting the number of letters in the filenames would not be a safe way of detecting a renamed filename.

Instead, create a simple file listing of the names, and compare it to an existing list. By using diff, you would be shown exactly which line(s) in the list had changed.

#!/bin/sh



LC_ALL=C



newlist=$HOME/filelist.new

oldlist=$HOME/filelist.old



echo /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar >"$newlist"



if [ -f "$oldlist" ]; then

    diff -u "$oldlist" "$newlist"

fi



mv "$newlist" "$oldlist"

Obviously, the first time you do this, filelist.old would not exist, so the diff would not run.

Note that I save the full path to each file in the output file. This does not matter since the directory path is static.

The LC_ALL=C is to guarantee a consistent sorting of the expansion of the shell glob.

edited Feb 28 at 20:47

answered Feb 28 at 19:56

Kusalananda

134k17255418

add a comment |

Counting the number of letters in the filenames would not be a safe way of detecting a renamed filename.

Instead, create a simple file listing of the names, and compare it to an existing list. By using diff, you would be shown exactly which line(s) in the list had changed.

#!/bin/sh



LC_ALL=C



newlist=$HOME/filelist.new

oldlist=$HOME/filelist.old



echo /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar >"$newlist"



if [ -f "$oldlist" ]; then

    diff -u "$oldlist" "$newlist"

fi



mv "$newlist" "$oldlist"

Obviously, the first time you do this, filelist.old would not exist, so the diff would not run.

Note that I save the full path to each file in the output file. This does not matter since the directory path is static.

The LC_ALL=C is to guarantee a consistent sorting of the expansion of the shell glob.

edited Feb 28 at 20:47

answered Feb 28 at 19:56

Kusalananda

134k17255418

add a comment |

Counting the number of letters in the filenames would not be a safe way of detecting a renamed filename.

Instead, create a simple file listing of the names, and compare it to an existing list. By using diff, you would be shown exactly which line(s) in the list had changed.

#!/bin/sh



LC_ALL=C



newlist=$HOME/filelist.new

oldlist=$HOME/filelist.old



echo /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar >"$newlist"



if [ -f "$oldlist" ]; then

    diff -u "$oldlist" "$newlist"

fi



mv "$newlist" "$oldlist"

Obviously, the first time you do this, filelist.old would not exist, so the diff would not run.

Note that I save the full path to each file in the output file. This does not matter since the directory path is static.

The LC_ALL=C is to guarantee a consistent sorting of the expansion of the shell glob.

edited Feb 28 at 20:47

answered Feb 28 at 19:56

Kusalananda

134k17255418

Counting the number of letters in the filenames would not be a safe way of detecting a renamed filename.

Instead, create a simple file listing of the names, and compare it to an existing list. By using diff, you would be shown exactly which line(s) in the list had changed.

#!/bin/sh



LC_ALL=C



newlist=$HOME/filelist.new

oldlist=$HOME/filelist.old



echo /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar >"$newlist"



if [ -f "$oldlist" ]; then

    diff -u "$oldlist" "$newlist"

fi



mv "$newlist" "$oldlist"

Obviously, the first time you do this, filelist.old would not exist, so the diff would not run.

Note that I save the full path to each file in the output file. This does not matter since the directory path is static.

The LC_ALL=C is to guarantee a consistent sorting of the expansion of the shell glob.

edited Feb 28 at 20:47

answered Feb 28 at 19:56

Kusalananda

134k17255418

edited Feb 28 at 20:47

answered Feb 28 at 19:56

Kusalananda

134k17255418

answered Feb 28 at 19:56

Kusalananda

134k17255418

answered Feb 28 at 19:56

Kusalananda

134k17255418

add a comment |

To just get the number of characters in the (non-hidden) jar filenames, I would do

cd /usr/hdp/2.6.4.0-91/tez_hive2/lib/ && printf %s *.jar | wc -m

This will not count any newlines, just the filename characters (replace wc -m with wc -c for the number of bytes instead of characters).

Purposefully, I'm not parsing ls output.

edited Feb 28 at 19:49

Stéphane Chazelas

309k57582942

answered Feb 28 at 19:46

glenn jackman

52.2k572112

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
Feb 28 at 19:51

add a comment |

To just get the number of characters in the (non-hidden) jar filenames, I would do

cd /usr/hdp/2.6.4.0-91/tez_hive2/lib/ && printf %s *.jar | wc -m

This will not count any newlines, just the filename characters (replace wc -m with wc -c for the number of bytes instead of characters).

Purposefully, I'm not parsing ls output.

edited Feb 28 at 19:49

Stéphane Chazelas

309k57582942

answered Feb 28 at 19:46

glenn jackman

52.2k572112

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
Feb 28 at 19:51

add a comment |

To just get the number of characters in the (non-hidden) jar filenames, I would do

cd /usr/hdp/2.6.4.0-91/tez_hive2/lib/ && printf %s *.jar | wc -m

This will not count any newlines, just the filename characters (replace wc -m with wc -c for the number of bytes instead of characters).

Purposefully, I'm not parsing ls output.

edited Feb 28 at 19:49

Stéphane Chazelas

309k57582942

answered Feb 28 at 19:46

glenn jackman

52.2k572112

To just get the number of characters in the (non-hidden) jar filenames, I would do

cd /usr/hdp/2.6.4.0-91/tez_hive2/lib/ && printf %s *.jar | wc -m

This will not count any newlines, just the filename characters (replace wc -m with wc -c for the number of bytes instead of characters).

Purposefully, I'm not parsing ls output.

edited Feb 28 at 19:49

Stéphane Chazelas

309k57582942

answered Feb 28 at 19:46

glenn jackman

52.2k572112

edited Feb 28 at 19:49

Stéphane Chazelas

309k57582942

edited Feb 28 at 19:49

Stéphane Chazelas

309k57582942

edited Feb 28 at 19:49

Stéphane Chazelas

309k57582942

answered Feb 28 at 19:46

glenn jackman

52.2k572112

answered Feb 28 at 19:46

glenn jackman

52.2k572112

answered Feb 28 at 19:46

glenn jackman

52.2k572112

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
Feb 28 at 19:51

add a comment |

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
Feb 28 at 19:51

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
Feb 28 at 19:51

add a comment |

I don't understand the point of the sed command, you are replacing a slash with a space? Why?

answered Feb 28 at 19:38

Scottie H

346

add a comment |

I don't understand the point of the sed command, you are replacing a slash with a space? Why?

answered Feb 28 at 19:38

Scottie H

346

add a comment |

I don't understand the point of the sed command, you are replacing a slash with a space? Why?

answered Feb 28 at 19:38

Scottie H

346

I don't understand the point of the sed command, you are replacing a slash with a space? Why?

answered Feb 28 at 19:38

Scottie H

346

answered Feb 28 at 19:38

Scottie H

346

answered Feb 28 at 19:38

Scottie H

346

answered Feb 28 at 19:38

Scottie H

346

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Unix & Linux Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Cfrgtkky