Actually, my original plan was to represent all collision masks as bitstreams, base64-encode them, and store the resulting string in the JSON alongside "width" and "height" values. It might seem ugly, but it's also easy, straightforward, and unsurprising. All good things.
Since the whole map will be in a ZIP file, the DEFLATE compression will eliminate a lot of the redundant bytes in JSON.
And, finally, if you really don't want the wallmask data inside the JSON, we can store its raw bitstream in a file in the zip and reference it from the JSON config.
Agreed with Orpheon: not a big deal either way.