TurnKey Linux Virtual Appliance Library

Two simple tricks for better shell script error handling

Psssst. Hey you... yeah you. Word on the street is your shell scripts don't do any error handling. They just chug happily along even when everything is broken.

Because a lowly shell shell script doesn't need any error handling right? WRONG!

Here are two simple tricks that are easy to use and will make your scripts much more robust.

  1. Turn on -e mode (do you feel lucky - punk?)

    In this mode any command your script runs which returns a non-zero exitcode - an error in the world of shell - will cause your script to itself terminate immediately with an error.

    You can do that in your shebang line:

    #!/bin/sh -e
    

    Or using set:

    set -e
    

    Yes, this is what you want. A neat predictable failure is infinitely better than a noisy unreliable failure.

    If you REALLY want to ignore an error, be explicit about it:

    # I don't care if evil-broken-command fails
    evil-broken-command || true
    

    Oh and as long as you're messing with shell modes, -e goes well with -x (which I like to think of as shell X-ray).

    Like this:

    #!/bin/sh -ex
    

    Or like this:

    # turn -x on if DEBUG is set to a non-empty string
    [ -n "$DEBUG" ] && set -x
    

    That way you can actually see what your script was doing right before it failed.

  2. Use trap for robust clean-ups

    A trap is a snippet of code that the shell executes when it exits or receives a signal. For example, pressing CTRL-C in the terminal where the script is running generates the INT signal. killing the process by default generates a TERM (I.e., terminate) signal.

    I find traps most useful for making sure my scripts clean-up after themselves whatever happens (e.g., a non-zero error code in -e mode).

    For example:

    
    #!/bin/sh -e
    
    TMPFILE=$(tempfile)
    trap 'echo "removing $TMPFILE"; rm -f $TMPFILE' INT TERM EXIT
    
    echo TMPFILE=$TMPFILE
    echo hello world > $TMPFILE
    cat $TMPFILE
    
    # gives user a chance to press CTRL-C
    sleep 3
    
    # false always returns an error
    false
    
    echo "NEVER REACHED"
    

    Note that you can only set one trap per signal. If you set a new trap you're implicitly disabling the old one. You can also disable a trap by specifying - as the argument, like this:

    trap - INT TERM EXIT
    
Enjoyed this post? Get future posts delivered by email or get the RSS feed.
Find us on Facebook and Twitter.
Last update: Tue, May 08 - 20:31

Comments

If you're going to trap signals, make sure you propagate them

 If you decide to trap INT or TERM, it would be wise to properly kill your process with INT or TERM:

# if you're using bash, you can use $BASHPID in place of $$
for sig in INT TERM EXIT; do
    trap "rm -f \"\$TMPFILE\"; [[ $sig == EXIT ]] || kill -$sig $$" $sig
done

Not propagating signals in this manner is being a bad Unix citizen. Bash would have re-raised the SIGNAL, so you should too.

This guy has it figured out: http://www.cons.org/cracauer/sigint.html

Liraz Siri's picture

Good point!

Thanks for the feedback. I haven't run into any problems (yet) obstructing signal propagation in my scripts but as you point out it's not The Correct Thing To Do. For the sake of correctness I'll be updating my error handling code. I find doing the correct thing often saves me from debugging strange edge cases.

what about exit command

Can't you just use "&& exit 1" in your cleanup instead of an explicit kill? Or is there a reason you should kill the PID instead?

Great post!

I think most people are guily of this. I know I have been but I am forcing myself to do it now. I will definitely be using this! Thanks!

Thanks to Guns as well for his comment above. I've got some reading to do.

Very informational. Thanks a lot!

Very informational. Thanks a lot!

There is more than one shell

Two simple tricks for better Bash script error handling - ftfy
Liraz Siri's picture

Updated the shebang line

If memory serves traps and option modes should be supported by any POSIX shell, not just bash. To reflect that I updated the shebang to be a bit more generic.

no, there's not

/bin/sh is The Shell. Programming in anything else is, at best, unreliably portable. and, linuxisms aside, /bin/sh is not necessarily bash. If you think there's something you can do in some other shell that can't be done in /bin/sh, you probably don't know /bin/sh well enough (or have found an interesting edge case).

 

(now get off my lawn)

I've been using trap in Korn

I've been using trap in Korn shell for years, works fine :-P

-e is not always a good idea

For simple scripts, -e may make life a little simpler, but be careful.

For instance, tar exits with 1 if a file changes during an archive create (-c), and it'll exit with 2 if a more serious error occurs.  So, if you use the -e approach, you'll have to do:

#!/bin/sh -e
...
set +e
tar cf /tmp/foo blah blah blah
if [ $? -eq 2 ]; then
    handle_error
else
    # we're ok
    :
fi
set -e
...

Actually, you can omit the

Actually, you can omit the "set +e" and do:

 

rc = 0
tar cf /tmp/foo blah blah blah || rc=$?
if [ $rc -eq 2 ] ; then 
...etc...

and it'll work just fine.

Very Helpful

I'm just getting started; I'm glad I caught this thread before I got too far along. Rik

set -u # "nounset" also useful

-

shell script for CPU alerts

I have wrote the below script but it asking password for each server. what I will do so that it ask passwod one as it is asking more user name

 

#!/bin/bash
#this is a script for CPU utlization
echo "Enter the list of servers and then press Ctrl D"
cat >serverlist
echo "Enter the Login name"
read login
#echo  "Please enter the password for the entered username:"
#stty -echo
#read pass
#echo "Re-enter Password:"
#stty -echo
#read passwd
#stty echo
#if [ $pass != $passwd ];then
#echo "Password doesn't match, please rerun the script"
#exit 1
#fi
cat <<EOF >msg
Hi,
 
The CPU Usage of the server mentioned in the subject line is very high, Please investigate the below report on CPU Usage.
 
                      *******************************
                             CPU USAGE REPORT
                      *******************************
 
 
EOF
 
for i in `cat serverlist`
do
ssh $login@$i "/usr/local/bin/top -b -n 1 2>/dev/null ; /usr/bin/top -b -n 1 2>/dev/null; sar -r 2 4; sar 2 4" >output
cat output
echo "Do you Want to escalate : Press y/n?"
read ch
if [ "$ch" = "y" ]
then
echo "Please Enter the Ticket number for this server"
read ticket
echo "Enter the APP Owner's email id"
read id
cat msg  output | mail -s "CPU utilization is HIGH for server $i ; Ticket $ticket" -c patanga.subhendu@gmail.com $id
else
echo "Moving to the next host now"
fi
done

Post new comment

The content of this field is kept private and will not be shown publicly. If you have a Gravatar account, used to display your avatar.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <p> <span> <div> <h1> <h2> <h3> <h4> <h5> <h6> <img> <map> <area> <hr> <br> <br /> <ul> <ol> <li> <dl> <dt> <dd> <table> <tr> <td> <em> <b> <u> <i> <strong> <font> <del> <ins> <sub> <sup> <quote> <blockquote> <pre> <address> <code> <cite> <strike> <caption>

More information about formatting options

Leave this field empty. It's part of a security mechanism.
(Dear spammers: moderators are notified of all new posts. Spam is deleted immediately)