Table of Contents

Parameter expansion

Introduction

One core functionality of Bash is to manage parameters. A parameter is an entity that stores values and is referenced by a name, a number or a special symbol.

Parameter expansion is the procedure to get the value from the referenced entity, like expanding a variable to print its value. On expansion time you can do very nasty things with the parameter or its value. These things are described here.

If you saw some parameter expansion syntax somewhere, and need to check what it can be, try the overview section below!

Arrays can be special cases for parameter expansion, every applicable description mentions arrays below. Please also see the article about arrays.

For a more technical view what a parameter is and which types exist, see the dictionary entry for "parameter".

Overview

Looking for a specific syntax you saw, without knowing the name?

Simple usage

$PARAMETER

${PARAMETER}

The easiest form is to just use a parameter's name within braces. This is identical to using $FOO like you see it everywhere, but has the advantage that it can be immediately followed by characters that would be interpreted as part of the parameter name otherwise. Compare these two expressions (WORD="car" for example), where we want to print a word with a trailing "s":

echo "The plural of $WORD is most likely $WORDs"
echo "The plural of $WORD is most likely ${WORD}s"

Why does the first one fail? It prints nothing, because a parameter (variable) named "WORDs" is undefined and thus printed as "" (nothing). Without using braces for parameter expansion, Bash will interpret the sequence of all valid characters from the introducing "$" up to the last valid character as name of the parameter. When using braces you just force Bash to only interpret the name inside your braces.

Also, please remember, that parameter names are (like nearly everything in UNIX®) case sensitive!

The second form with the curly braces is also needed to access positional parameters (arguments to a script) beyond $9:

echo "Argument  1 is: $1"
echo "Argument 10 is: ${10}"

Simple usage: Arrays

See also the article about general array syntax

For arrays you always need the braces. The arrays are expanded by individual indexes or mass arguments. An individual index behaves like a normal parameter, for the mass expansion, please read the article about arrays linked above.

Indirection

${!PARAMETER}

In some cases, like for example

${PARAMETER}

${PARAMETER:0:3}

you can instead use the form

${!PARAMETER}

to enter a level of indirection. The referenced parameter is not PARAMETER itself, but the parameter whose name is stored as the value of PARAMETER. If the parameter PARAMETER has the value "TEMP", then ${!PARAMETER} will expand to the value of the parameter named TEMP:

read -rep 'Which variable do you want to inspect? ' look_var

printf 'The value of "%s" is: "%s"\n' "$look_var" "${!look_var}" 

Of course the indirection also works with special variables:

# set some fake positional parameters
set one two three four

# get the LAST argument ("#" stores the number of arguments, so "!#" will reference the LAST argument)
echo ${!#}

You can think of this mechanism as being roughly equivalent to taking any parameter expansion that begins with the parameter name, and substituting the !PARAMETER part with the value of PARAMETER.

echo "${!var^^}"
# ...is equivalent to
eval 'echo "${'"$var"'^^}"'

It was an unfortunate design decision to use the ! prefix for indirection, as it introduces parsing ambiguity with other parameter expansions that begin with !. Indirection is not possible in combination with any parameter expansion whose modifier requires a prefix to the parameter name. Specifically, indirection isn't possible on the ${!var@}, ${!var*}, ${!var[@]}, ${!var[*]}, and ${#var} forms. This means the ! prefix can't be used to retrieve the indices of an array, the length of a string, or number of elements in an array indirectly (see indirection for workarounds). Additionally, the !-prefixed parameter expansion conflicts with ksh-like shells which have the more powerful "name-reference" form of indirection, where the exact same syntax is used to expand to the name of the variable being referenced.

Indirect references to array names are also possible since the Bash 3 series (exact version unknown), but undocumented. See indirection for details.

Chet has added an initial implementation of the ksh nameref declaration command to the git devel branch. (declare -n, local -n, etc, will be supported). This will finally address many issues around passing and returning complex datatypes to/from functions.

Case modification

${PARAMETER^}

${PARAMETER^^}

${PARAMETER,}

${PARAMETER,,}

${PARAMETER~}

${PARAMETER~~}

These expansion operators modify the case of the letters in the expanded text.

The ^ operator modifies the first character to uppercase, the , operator to lowercase. When using the double-form (^^ and ,,), all characters are converted.

The (currently undocumented) operators ~ and ~~ reverse the case of the given text (in PARAMETER).~ reverses the case of first letter of words in the variable while ~~ reverses case for all. Thanks to Bushmills and geirha on the Freenode IRC channel for this finding.

Example: Rename all *.txt filenames to lowercase

for file in *.txt; do
  mv "$file" "${file,,}"
done

Note: Case modification is a handy feature you can apply to a name or a title. Or is it? Case modification was an important aspect of the Bash 4 release. Bash version 4, RC1 would perform word splitting, and then case modification, resulting in title case (where every word is capitalized). It was decided to apply case modification to values, not words, for the Bash 4 release. Thanks Chet.

Case modification: Arrays

Case modification can be used to create the proper capitalization for names or titles. Just assign it to an array:

declare -a title=(my hello world john smith)

For array expansion, the case modification applies to every expanded element, no matter if you expand an individual index or mass-expand the whole array using @ or * subscripts. Some examples:

Assume: array=(This is some Text)

Variable name expansion

${!PREFIX*}

${!PREFIX@}

This expands to a list of all set variable names beginning with the string PREFIX. The elements of the list are separated by the first character in the IFS-variable (<space> by default).

This will show all defined variable names (not values!) beginning with "BASH":

$ echo ${!BASH*}
BASH BASH_ARGC BASH_ARGV BASH_COMMAND BASH_LINENO BASH_SOURCE BASH_SUBSHELL BASH_VERSINFO BASH_VERSION

This list will also include array names.

Substring removal

${PARAMETER#PATTERN}

${PARAMETER##PATTERN}

${PARAMETER%PATTERN}

${PARAMETER%%PATTERN}

This one can expand only a part of a parameter's value, given a pattern to describe what to remove from the string. The pattern is interpreted just like a pattern to describe a filename to match (globbing). See Pattern matching for more.

Example string (just a quote from a big man):

MYSTRING="Be liberal in what you accept, and conservative in what you send"

From the beginning

${PARAMETER#PATTERN} and ${PARAMETER##PATTERN}

This form is to remove the described pattern trying to match it from the beginning of the string. The operator "#" will try to remove the shortest text matching the pattern, while "##" tries to do it with the longest text matching. Look at the following examples to get the idea (matched text marked striked, remember it will be removed!):

SyntaxResult
${MYSTRING#*in}Be liberal in what you accept, and conservative in what you send
${MYSTRING##*in}Be liberal in what you accept, and conservative in what you send

From the end

${PARAMETER%PATTERN} and ${PARAMETER%%PATTERN}

In the second form everything will be the same, except that Bash now tries to match the pattern from the end of the string:

Syntax Result
${MYSTRING%in*} Be liberal in what you accept, and conservative in what you send
${MYSTRING%%in*} Be liberal in what you accept, and conservative in what you send

The second form nullifies variables that begin with in, by working from the end.

Common use

How the heck does that help to make my life easier?

Well, maybe the most common use for it is to extract parts of a filename. Just look at the following list with examples:

These are the syntaxes for filenames with a single extension. Depending on your needs, you might need to adjust shortest/longest match.

Substring removal: Arrays

As for most parameter expansion features, working on arrays will handle each expanded element, for individual expansion and also for mass expansion.

Simple example, removing a trailing is from all array elements (on expansion):

Assume: array=(This is a text)

All other variants of this expansion behave the same.

Search and replace

${PARAMETER/PATTERN/STRING}

${PARAMETER//PATTERN/STRING}

${PARAMETER/PATTERN}

${PARAMETER//PATTERN}

This one can substitute (replace) a substring matched by a pattern, on expansion time. The matched substring will be entirely removed and the given string will be inserted. Again some example string for the tests:

MYSTRING="Be liberal in what you accept, and conservative in what you send"

The two main forms only differ in the number of slashes after the parameter name: ${PARAMETER/PATTERN/STRING} and ${PARAMETER//PATTERN/STRING}

The first one (one slash) is to only substitute the first occurrence of the given pattern, the second one (two slashes) is to substitute all occurrences of the pattern.

First, let's try to say "happy" instead of "conservative" in our example string:

${MYSTRING//conservative/happy}
Be liberal in what you accept, and conservativehappy in what you send

Since there is only one "conservative" in that example, it really doesn't matter which of the two forms we use.

Let's play with the word "in", I don't know if it makes any sense, but let's substitute it with "by".

First form: Substitute first occurrence

${MYSTRING/in/by}
Be liberal inby what you accept, and conservative in what you send

Second form: Substitute all occurrences

${MYSTRING//in/by}
Be liberal inby what you accept, and conservative inby what you send

Anchoring Additionally you can "anchor" an expression: A # (hashmark) will indicate that your expression is matched against the beginning portion of the string, a % (percent-sign) will do it for the end portion.

MYSTRING=xxxxxxxxxx
echo ${MYSTRING/#x/y}  # RESULT: yxxxxxxxxx
echo ${MYSTRING/%x/y}  # RESULT: xxxxxxxxxy

If the replacement part is completely omitted, the matches are replaced by the nullstring, i.e., they are removed. This is equivalent to specifying an empty replacement:

echo ${MYSTRING//conservative/}
# is equivalent to
echo ${MYSTRING//conservative}

Search and replace: Arrays

This parameter expansion type applied to arrays applies to all expanded elements, no matter if an individual element is expanded, or all elements using the mass expansion syntaxes.

A simple example, changing the (lowercase) letter t to d:

Assume: array=(This is a text)

String length

${#PARAMETER}

When you use this form, the length of the parameter's value is expanded. Again, a quote from a big man, to have a test text:

MYSTRING="Be liberal in what you accept, and conservative in what you send"

Using echo ${#MYSTRING}

64

The length is reported in characters, not in bytes. Depending on your environment this may not always be the same (multibyte-characters, like in UTF8 encoding).

There's not much to say about it, mh?

(String) length: Arrays

For arrays, this expansion type has two meanings:

Example:

Assume: array=(This is a text)

Attention: The number of used elements does not need to conform to the highest index. Sparse arrays are possible in Bash, that means you can have 4 elements, but with indexes 1, 7, 20, 31. You can't loop through such an array with a counter loop based on the number of elements!

Substring expansion

${PARAMETER:OFFSET}

${PARAMETER:OFFSET:LENGTH}

This one can expand only a part of a parameter's value, given a position to start and maybe a length. If LENGTH is omitted, the parameter will be expanded up to the end of the string. If LENGTH is negative, it's taken as a second offset into the string, counting from the end of the string.

OFFSET and LENGTH can be any arithmetic expression. Take care: The OFFSET starts at 0, not at 1!

Example string (a quote from a big man): MYSTRING="Be liberal in what you accept, and conservative in what you send"

Using only Offset

In the first form, the expansion is used without a length value, note that the offset 0 is the first character:

echo ${MYSTRING:35}
Be liberal in what you accept, and conservative in what you send

Using Offset and Length

In the second form we also give a length value:

echo ${MYSTRING:35:12}
Be liberal in what you accept, and conservative in what you send

Negative Offset Value

If the given offset is negative, it's counted from the end of the string, i.e. an offset of -1 is the last character. In that case, the length still counts forward, of course. One special thing is to do when using a negative offset: You need to separate the (negative) number from the colon:

${MYSTRING: -10:5}
${MYSTRING:(-10):5}
Why? Because it's interpreted as the parameter expansion syntax to use a default value.

Negative Length Value

If the LENGTH value is negative, it's used as offset from the end of the string. The expansion happens from the first to the second offset then:

echo "${MYSTRING:11:-17}"
Be liberal in what you accept, and conservative in what you send

This works since Bash 4.2-alpha, see also Bash changes.

Substring/Element expansion: Arrays

For arrays, this expansion type has again 2 meanings:

Example:

Assume: array=(This is a text)

Use a default value

${PARAMETER:-WORD}

${PARAMETER-WORD}

If the parameter PARAMETER is unset (never was defined) or null (empty), this one expands to WORD, otherwise it expands to the value of PARAMETER, as if it just was ${PARAMETER}. If you omit the : (colon), like shown in the second form, the default value is only used when the parameter was unset, not when it was empty.

echo "Your home directory is: ${HOME:-/home/$USER}."
echo "${HOME:-/home/$USER} will be used to store your personal data."

If HOME is unset or empty, everytime you want to print something useful, you need to put that parameter syntax in.

#!/bin/bash

read -p "Enter your gender (just press ENTER to not tell us): " GENDER
echo "Your gender is ${GENDER:-a secret}."

It will print "Your gender is a secret." when you don't enter the gender. Note that the default value is used on expansion time, it is not assigned to the parameter.

Use a default value: Arrays

For arrays, the behaviour is very similar. Again, you have to make a difference between expanding an individual element by a given index and mass-expanding the array using the @ and * subscripts.

In other words: The basic meaning of this expansion type is applied as consistent as possible to arrays.

Example code (please try the example cases yourself):

####
# Example cases for unset/empty arrays and nullstring elements
####


### CASE 1: Unset array (no array)

# make sure we have no array at all
unset array

echo ${array[@]:-This array is NULL or unset}
echo ${array[@]-This array is NULL or unset}

### CASE 2: Set but empty array (no elements)

# declare an empty array
array=()

echo ${array[@]:-This array is NULL or unset}
echo ${array[@]-This array is NULL or unset}


### CASE 3: An array with only one element, a nullstring
array=("")

echo ${array[@]:-This array is NULL or unset}
echo ${array[@]-This array is NULL or unset}


### CASE 4: An array with only two elements, a nullstring and a normal word
array=("" word)

echo ${array[@]:-This array is NULL or unset}
echo ${array[@]-This array is NULL or unset}

Assign a default value

${PARAMETER:=WORD}

${PARAMETER=WORD}

This one works like the using default values, but the default text you give is not only expanded, but also assigned to the parameter, if it was unset or null. Equivalent to using a default value, when you omit the : (colon), as shown in the second form, the default value will only be assigned when the parameter was unset.

echo "Your home directory is: ${HOME:=/home/$USER}."
echo "$HOME will be used to store your personal data."

After the first expansion here (${HOME:=/home/$USER}), HOME is set and usable.

Let's change our code example from above:

#!/bin/bash

read -p "Enter your gender (just press ENTER to not tell us): " GENDER
echo "Your gender is ${GENDER:=a secret}."
echo "Ah, in case you forgot, your gender is really: $GENDER"

Assign a default value: Arrays

For arrays this expansion type is limited. For an individual index, it behaves like for a "normal" parameter, the default value is assigned to this one element. The mass-expansion subscripts @ and * can not be used here because it's not possible to assign to them!

Use an alternate value

${PARAMETER:+WORD}

${PARAMETER+WORD}

This form expands to nothing if the parameter is unset or empty. If it is set, it does not expand to the parameter's value, but to some text you can specify:

echo "The Java application was installed and can be started.${JAVAPATH:+ NOTE: JAVAPATH seems to be set}"
The above code will simply add a warning if JAVAPATH is set (because it could influence the startup behaviour of that imaginary application).

Some more unrealistic example… Ask for some flags (for whatever reason), and then, if they were set, print a warning and also print the flags:

#!/bin/bash

read -p "If you want to use special flags, enter them now: " SPECIAL_FLAGS
echo "The installation of the application is finished${SPECIAL_FLAGS:+ (NOTE: there are special flags set: $SPECIAL_FLAGS)}."

If you omit the colon, as shown in the second form (${PARAMETER+WORD}), the alternate value will be used if the parameter is set (and it can be empty)! You can use it, for example, to complain if variables you need (and that can be empty) are undefined:

# test that with the three stages:

# unset foo
# foo=""
# foo="something"

if [[ ${foo+isset} = isset ]]; then
  echo "foo is set..."
else
  echo "foo is not set..."
fi

Use an alternate value: Arrays

Similar to the cases for arrays to expand to a default value, this expansion behaves like for a "normal" parameter when using individual array elements by index, but reacts differently when using the mass-expansion subscripts @ and *:

For some cases to play with, please see the code examples in the description for using a default value.

Display error if null or unset

${PARAMETER:?WORD}

${PARAMETER?WORD}

If the parameter PARAMETER is set/non-null, this form will simply expand it. Otherwise, the expansion of WORD will be used as appendix for an error message:

$ echo "The unset parameter is: ${p_unset?not set}"
bash: p_unset: not set

After printing this message,

The meaning of the colon (:) is the same as for the other parameter expansion syntaxes: It specifies if

are taken into account.

Code examples

Substring removal

Removing the first 6 characters from a text string:

STRING="Hello world"

# only print 'Hello'
echo "${STRING%??????}"

# only print 'world'
echo "${STRING#??????}"

# store it into the same variable
STRING=${STRING#??????}

Bugs and Portability considerations

Quote Nesting

# Bash
 $ typeset -a a=(meh bleh blerg) b
 $ IFS=e
 $ printf "<%s> " "${b[@]-"${a[@]}" "${a[@]}"}"; echo # The entire PE is quoted so Bash considers the inner quotes redundant.
<meh> <bleh> <blerg meh> <bleh> <blerg>
 $ printf "<%s> " "${b[@]-${a[@]} ${a[@]}}"; echo # The outer quotes cause the inner expansions to be considered quoted.
<meh> <bleh> <blerg meh> <bleh> <blerg>
 $ b=(meep beep)
 $ printf "<%s> " "${b[@]-"${a[@]}" "${a[@]}"}" "${b[@]-${a[@]} ${a[@]}}"; echo # Again no surprises. Outer quotes quote everything recursively.
<meep> <beep> <meep> <beep>

Now lets see what can happen if we leave the outside unquoted.

# Bash
 $ typeset -a a=(meh bleh blerg) b
 $ IFS=e
 $ printf "<%s> " ${b[@]-"${a[@]}" "${a[@]}"}; echo # Inner quotes make inner expansions quoted.
<meh> <bleh> <blerg meh> <bleh> <blerg>
 $ printf "<%s> " ${b[@]-${a[@]} ${a[@]}}; echo' # No quotes at all wordsplits / globs, like you'd expect.
<m> <h> <bl> <h> <bl> <rg m> <h> <bl> <h> <bl> <rg>

This all might be intuitive, and is the most common implementation, but this design sucks for a number of reasons. For one, it means Bash makes it absolutely impossible to expand any part of the inner region unquoted while leaving the outer region quoted. Quoting the outer forces quoting of the inner regions recursively (except nested command substitutions of course). Word-splitting is necessary to split words of the inner region, which cannot be done together with outer quoting. Consider the following (only slightly far-fetched) code:

# Bash (non-working example)

unset -v IFS # make sure we have a default IFS

if some crap; then
    typeset -a someCmd=(myCmd arg1 'arg2 yay!' 'third*arg*' 4)
fi

someOtherCmd=mycommand
typeset -a otherArgs=(arg3 arg4)

# What do you think the programmer expected to happen here?
# What do you think will actually happen...

"${someCmd[@]-"$someOtherCmd" arg2 "${otherArgs[@]}"}" arg5

This final line is perhaps not the most obvious, but I've run into cases were this type of logic can be desirable and realistic. We can deduce what was intended:

Unfortunately, it is impossible to get the intended result in Bash (and most other shells) without taking a considerably different approach. The only way to split the literal inner parts is through word-splitting, which requires that the PE be unquoted. But, the only way to expand the outer expansion correctly without word-splitting or globbing is to quote it. Bash will actually expand the command as one of these:

# The quoted PE produces a correct result here...
 $ bash -c 'typeset -a someCmd=(myCmd arg1 "arg2 yay!" "third*arg*" 4); printf "<%s> " "${someCmd[@]-"$someOtherCmd" arg2 "${otherArgs[@]}"}" arg5; echo'
<myCmd> <arg1> <arg2 yay!> <third*arg*> <4> <arg5>

# ...but in the opposite case the first 3 arguments are glued together. There are no workarounds.
 $ bash -c 'typeset -a otherArgs=(arg3 arg4); someOtherCmd=mycommand; printf "<%s> " "${someCmd[@]-"$someOtherCmd" arg2 "${otherArgs[@]}"}" arg5; echo'
<mycommand arg2 arg3> <arg4> <arg5>

# UNLESS! we unquote the outer expansion allowing the inner quotes to
# affect the necessary parts while allowing word-splitting to split the literals:
 $ bash -c 'typeset -a otherArgs=(arg3 arg4); someOtherCmd=mycommand; printf "<%s> " ${someCmd[@]-"$someOtherCmd" arg2 "${otherArgs[@]}"} arg5; echo'
<mycommand> <arg2> <arg3> <arg4> <arg5>

# Success!!!
 $ bash -c 'typeset -a someCmd=(myCmd arg1 "arg2 yay!" "third*arg*" 4); printf "<%s> " ${someCmd[@]-"$someOtherCmd" arg2 "${otherArgs[@]}"} arg5; echo'
<myCmd> <arg1> <arg2> <yay!> <third*arg*> <4> <arg5>

# ...Ah f^^k. (again, no workaround possible.)

The ksh93 exception

To the best of my knowledge, ksh93 is the only shell that acts differently. Rather than forcing nested expansions into quoting, a quote at the beginning and end of the nested region will cause the quote state to reverse itself within the nested part. I have no idea whether it's an intentional or documented effect, but it does solve the problem and consequently adds a lot of potential power to these expansions.

All we need to do is add two extra double-quotes:

# ksh93 passing the two failed tests from above:

 $ ksh -c 'otherArgs=(arg3 arg4); someOtherCmd="mycommand"; printf "<%s> " "${someCmd[@]-""$someOtherCmd" arg2 "${otherArgs[@]}""}" arg5; echo'
<mycommand> <arg2> <arg3> <arg4> <arg5>

 $ ksh -c 'typeset -a someCmd=(myCmd arg1 "arg2 yay!" "third*arg*" 4); printf "<%s> " "${someCmd[@]-""$someOtherCmd" arg2 "${otherArgs[@]}""}" arg5; echo'
<myCmd> <arg1> <arg2 yay!> <third*arg*> <4> <arg5>

This can be used to control the quote state of any part of any expansion to an arbitrary depth. Sadly, it is the only shell that does this and the difference may introduce a possible compatibility problem.

See also