Built-in method to escape shell arguments

ThisIsMissEm commented 4 years ago

Is your feature request related to a problem? Please describe.

Yes, I was using execa which is a light wrapper for childProcess.spawn to execute a script (call it ./scripts/configure) which took user input as an argument. One of my users supplied "Users & Permissions Management" as that input, which caused the script to hang as the resulting spawned process was:

./scripts/configure Users & Permissions Management

I realised as soon as the bug was reported that I should've escaped the string passed into my function that called execa, so then I looked for modules to correctly escape shell arguments, and they seem pretty complex. Which leads to the question: do I really want to depend on a third-party module to correctly escape shell arguments? Am I just trading one security risk for another?

Describe the solution you'd like

Have a method like childProcess.escapeArgument(arg: string): string which correctly escapes the given value such that it is just a string for all terminals (cross-platform).

Clarification: I am not arguing for childProcess.spawn to escape arguments into strings by default, as that'd be a breaking change, even though it would likely be for the best (if you wanna pass multiple arguments, use the array syntax, not a string). Instead, I'm just asking for a method built-in that's well tested to escape an argument into a string argument for a shell command.

Describe alternatives you've considered

Various NPM modules, writing it myself, etc. All just shift the security responsibility to arguably worse places. This seems like due to the security benefits it can give, it'd be a good candidate for being a built-in function, ideally backported to LTS's

ThisIsMissEm commented 4 years ago

There are also some really bad suggestions on StackOverflow about how to do this: https://stackoverflow.com/questions/1779858/how-do-i-escape-a-string-for-a-shell-command-in-node

(my favourite there is "covert it to base64, then pipe base64 -d then pipe to your command")

ThisIsMissEm commented 4 years ago

More background context: https://twitter.com/thisismissem/status/1296071082504531970?s=21

devsnek commented 4 years ago

I think, overall, you just shouldn't be trying to pass user input into shell commands. if you use child_process.spawn directly you can pass arguments as an array without worrying about escaping but you're still passing arbitrary input into exec.

ThisIsMissEm commented 4 years ago

the problem is even with child_process.spawn you'll still end up with incorrect command execution.

const value = "Users & Permissions Management";
child_process.spawn('./scripts/configure', [value])

Will still result in ./scripts/configure Users & Permissions Management, and you do sometimes need to be able to accept user input and safely pass it to another command.

devsnek commented 4 years ago

@ThisIsMissEm that would only be a problem if ./scripts/configure is a shell script. every shell has different rules for how arguments are parsed, so i'm not sure there's a productive path forward to us solving that in node.

ThisIsMissEm commented 4 years ago

Existing Modules:

Each of these produces different output. In ruby, the shellwords.escape method, which is commonly recommended also produces different output to all of those, iirc.

ThisIsMissEm commented 4 years ago

In my specific case, it's ('yarn', ['configure', '--set-something', value]) and the script under yarn configure actually also executes a child process, so it's at least two child process executions away from the arguments parser

But if I could easily escape value such that is was a string and always a single string, then the value would pass through fine.

ThisIsMissEm commented 4 years ago

@devsnek this is purely about providing a way to escape a string for shell usage — I would imagine majority are similar, maybe some differences between windows and unix/posix compliant shells, but that'd be it, I'd imagine.

devsnek commented 4 years ago

userland is the appropriate place for this imo.

bnoordhuis commented 4 years ago

Just so we're on the same page, the algorithm for escaping one shell argument is this:

function quote(s) {
  if (s === '') return `''`;
  if (!/[^%+,-.\/:=@_0-9A-Za-z]/.test(s)) return s;
  return `'` + s.replace(/'/g, `'"'`) + `'`;
}

The two if guards are not strictly necessary but they make for nicer output. This one-liner is also valid:

const quote = (s) => `'` + s.replace(/'/g, `'"'`) + `'`;

I'm a bit on the fence as to whether something that simple belongs in core but I wouldn't strenuously object.

silverwind commented 4 years ago

Another alternative is to replace with escaped single quote. Not sure if it'd make any practical difference.

s.replace(/'/gm, `'\\''`)

bnoordhuis commented 4 years ago

https://www.gnu.org/savannah-checkouts/gnu/bash/manual/bash.html#Single-Quotes

Enclosing characters in single quotes (‘'’) preserves the literal value of each character within the quotes. A single quote may not occur between single quotes, even when preceded by a backslash.

silverwind commented 4 years ago

Of course the output would be wrapped in single quotes so it's not actually putting single quotes inside single quotes. I consider it more secure because it keep variables inside single quotes inside such while the "'" method enables variable expansion (which may expose secret environment variables to the user):

$ bash -c 'echo "'"$EDITOR"'"'
nvim
$ bash -c 'echo '\''$EDITOR'\'''
$EDITOR

I guess this difference alone means there can not be one single escaping method because variable expansion may or may not be desired depending on use case.

bnoordhuis commented 4 years ago

while the "'" method enables variable expansion

Right, but note that the sample I posted uses '"' - quote, double qote, quote.

silverwind commented 4 years ago

Indeed, my mistake. The end result is still the same with variable expansion being possible.


$ bash -c 'echo '"'$EDITOR'"''
nvim

bnoordhuis commented 4 years ago

Still not apples to apples. :-)

The argument to bash -c in your example is evaluated by the outer shell. That gotcha doesn't apply to child_process.spawn() or child_process.exec().

Just so we're on the same page, the thing quote() protects against is unsanitized input like this:

let input = `1337'; cat '/etc/passwd`
let banner = child_process.execSync(`/usr/bin/banner '${input}'`)
socket.write(banner)

(Excerpted from my banner-as-a-service SaaS. I'm taking VC funding.)

silverwind commented 4 years ago

Interesting. I thought exec('cmd') was literally equivalent to sh -c 'cmd' in the terminal but I see we're safe from those expansions because there's no outer shell involved. Anyways, I think my method is still slightly superior because it does can not fail with unbalanced quotes:

> String(child_process.execSync(`echo '${`1337'; cat '/etc/passwd`.replace(/'/gm, `'\\''`)}'`))
"1337'; cat '/etc/passwd\n"
> String(child_process.execSync(`echo '${`1337"; cat "/etc/passwd`.replace(/'/gm, `'\\''`)}'`))
'1337"; cat "/etc/passwd\n'
> String(child_process.execSync(`echo '${`1337"; cat '/etc/passwd`.replace(/'/gm, `'\\''`)}'`))
`1337"; cat '/etc/passwd\n`

> String(child_process.execSync(`echo '${`1337'; cat '/etc/passwd`.replace(/'/gm, `'"'`)}'`))
"1337'; cat '/etc/passwd\n"
> String(child_process.execSync(`echo '${`1337"; cat "/etc/passwd`.replace(/'/gm, `'"'`)}'`))
'1337"; cat "/etc/passwd\n'
> String(child_process.execSync(`echo '${`1337"; cat '/etc/passwd`.replace(/'/gm, `'"'`)}'`))
/bin/sh: -c: line 0: unexpected EOF while looking for matching `"'
/bin/sh: -c: line 1: syntax error: unexpected end of file

devsnek commented 4 years ago

I'd still say the ultimate method is to avoid passing random strings to shell scripts via arguments. In the example the OP brought up with yarn, they could use the yarn node api instead of invoking yarn with child_process.

ThisIsMissEm commented 4 years ago

At some point user input will need to travel from node.js to a separate binary, using a javascript API for a thing isn't always an answer (look at anything that does transcoding, as that tends to be a mess of ffmpeg arguments, and using a binary API is often less desirable here).

Whilst there are security issues around passing arbitrary data, as we can see from this issue, there are already multiple suggested ways of escaping data, which makes it much harder for the average developer to know how to do this correctly, ergo, node.js core should provide a recommended way for such a common vector for security attacks.

devsnek commented 4 years ago

ffmpeg isn't a shell, it won't reparse the arguments into something that can spawn new programs. this is only a problem if you're passing arguments to a shell.

ChALkeR commented 3 years ago

The recommended solution is to use execFile, or any other child_process method that accepts structured input (executable name/path and and array of arguments), and not build the shell command with string concatenation at all.

tniessen commented 3 years ago

@ThisIsMissEm What I tend to do is, create a wrapper shell script, and pass all values as named environment variables from Node.js to the shell script. The shell script can then pass those as options and positional arguments to any other commands. This has been more reliable than trying to build a command line with proper escaping.

rtweeks commented 2 years ago

If childProcess.exec is going to invoke some arbitrary shell, then childProcess ought to provide a function/method for escaping arguments appropriate to that shell. I don't see much discussion of this here, but in process.platform === 'win32' land, shell rules are different than bash. The code offered above definitely does not seem to account for that.

kwasimensah commented 2 years ago

Stumbling across this. The use case that's also missing is calculating command lines that will be used elsewhere. For my example, I'm creating a package.json programmatically and want to safely quote what's going into the bin field for a command line

github-actions[bot] commented 1 year ago

There has been no activity on this feature request for 5 months and it is unlikely to be implemented. It will be closed 6 months after the last non-automated comment.

For more information on how the project manages feature requests, please consult the feature request management document.

github-actions[bot] commented 1 year ago

There has been no activity on this feature request and it is being closed. If you feel closing this issue is not the right thing to do, please leave a comment.

For more information on how the project manages feature requests, please consult the feature request management document.

mnpenner commented 1 year ago

If childProcess.exec is going to invoke some arbitrary shell, then childProcess ought to provide a function/method for escaping arguments appropriate to that shell. I don't see much discussion of this here, but in process.platform === 'win32' land, shell rules are different than bash. The code offered above definitely does not seem to account for that.

This is still kind of half baked, but I'm attempting to sniff out the shell and escape it properly here: https://github.com/mnpenner/child-spawn/blob/a92d7180c5d5d9620cfdb8d6db8a44624b157a3f/src/index.ts#L115

Currently just supports bash/zsh and dash (which I think is what sh normally is? At least on Ubuntu).

yonathan06 commented 1 year ago

@ThisIsMissEm I came across the same issue, building an app for both Mac and Win I tried many techniques (regex matching, checking for special chars, wrapping all in ' or "), but for all of them it didn't work in all cases I eventually ended up going through each arg (an array of strings) and calling JSON.stringify to each one, and it did the trick :)

mnpenner commented 1 year ago

@ThisIsMissEm I came across the same issue, building an app for both Mac and Win I tried many techniques (regex matching, checking for special chars, wrapping all in ' or "), but for all of them it didn't work in all cases I eventually ended up going through each arg (an array of strings) and calling JSON.stringify to each one, and it did the trick :)

Shells don't parse args as JSON. I'd be very weary of using JSON.stringify.

yonathan06 commented 1 year ago

@ThisIsMissEm I came across the same issue, building an app for both Mac and Win I tried many techniques (regex matching, checking for special chars, wrapping all in ' or "), but for all of them it didn't work in all cases I eventually ended up going through each arg (an array of strings) and calling JSON.stringify to each one, and it did the trick :)

Shells don't parse args as JSON. I'd be very weary of using JSON.stringify.

I was referring to parsing each arg, not the all array

const args = ['dosomething', '/path/to/my/file', '--flag', '10'];
const parsedArgs = args.map(a => JSON.stringify(a));
exec(parsedArgs.join(' '));

You are right the it is not 100% recommended, and I can't guarantee anything here, I only tested with use cases my users have, and it did the job, but perhaps you are right and it won't work well in other cases that I didn't catch yet

yonathan06 commented 1 year ago

@ThisIsMissEm I came across the same issue, building an app for both Mac and Win I tried many techniques (regex matching, checking for special chars, wrapping all in ' or "), but for all of them it didn't work in all cases I eventually ended up going through each arg (an array of strings) and calling JSON.stringify to each one, and it did the trick :)

Shells don't parse args as JSON. I'd be very weary of using JSON.stringify.

@mnpenner you are right, it was reckless, to everyone who sees it ignore this example

nodejs / node

Built-in method to escape shell arguments #34840