JSON.isWellFormed(str: string): boolean

cesco69 · September 29, 2025, 8:18am

Introduction

This proposal introduces a new method to the JSON namespace:

JSON.isWellFormed(str: string): boolean

The method returns true if and only if the input string does not contain characters that would require escaping when serialized via JSON.stringify.

Motivation

In many applications, developers want to know in advance whether a string will produce a "clean" JSON representation, without escape sequences.

Examples:

Optimizing serialization performance.
Ensuring human-readable JSON outputs.
Validating user input before sending data over the network or storing in a database.

A dedicated JSON.isWellFormed method would provide a fast, standardized way to check this.

Definition

A string is considered well-formed for JSON if it does not contain:

ASCII control characters: U+0000 → U+001F
Quotation mark (", U+0022)
Reverse solidus (backslash, \, U+005C)
Unpaired UTF-16 surrogates: U+D800 → U+DFFF

Examples

JSON.isWellFormed("hello world");
// → true

JSON.isWellFormed("hello \"world\"");
// → false (contains quotation mark)

JSON.isWellFormed("line \n break");
// → false (contains control char)

JSON.isWellFormed("emoji 😀");
// → true

JSON.isWellFormed("\uD800"); 
// → false (lone surrogate)

Possible Specification (Informal)

function isWellFormed(str) {
  for (let i = 0; i < str.length; i++) {
    const code = str.codePointAt(i);

    // ASCII control characters
    if (code <= 0x1F) return false;

    // " or \
    if (code === 0x22 || code === 0x5C) return false;

    // Unpaired surrogate
    if (code >= 0xD800 && code <= 0xDFFF) return false;

    if (code > 0xFFFF) i++; // skip surrogate pair
  }
  return true;
}

Prior Art

String.prototype.isWellFormed and String.prototype.toWellFormed (added in ES2024) ensure valid Unicode strings but are unrelated to JSON escaping.
This proposal specifically targets JSON serialization safety.

aclaymore · September 29, 2025, 8:33am

would you be able to share links to code that is checking this?

In my experience most code is using JSON as a serialisation format, so they only care that you’ll get the same value back when parsing the JSON. Not if it needs to include escape characters to do so.

cesco69 · September 29, 2025, 8:38am

see fast-json-stringify/lib/serializer.js at cc14cc8abe958274cacca05e8ab8e3a8fd0410b8 · fastify/fast-json-stringify · GitHub

"fastify" check those chars for each response because it use a built in json-stringify (GitHub - fastify/fast-json-stringify: 2x faster than JSON.stringify())

aclaymore · September 29, 2025, 8:53am

Thanks! So it’s common for custom JSON encoder libraries, rather than the top level application itself?

cesco69 · September 29, 2025, 9:01am

It’s not very common in everyday usage, but it can be extremely useful in critical high-traffic scenarios. JSON.stringify is relatively slower, and custom stringifiers can provide a real performance boost.

see:

custom stringifiers need something like JSON.isWellFormed()

Actually it uses regex

github.com/elysiajs/json-accelerator

src/index.ts

7fcbf62c8


      
          	 * - 'ignore': Ignore the unsafe character, this implied that end user should handle it
          	 * - 'sanitize': Sanitize the string and continue encoding
          	 *
          	 * @default 'sanitize'
          	 **/
          	sanitize: 'auto' | 'manual' | 'throw'
          	definitions: Record<string, TAnySchema>
          }
          
          // equivalent to /["\n\r\t\b\f\v]/
          const findEscapeSequence = /["\b\t\n\v\f\r\/]/
          
          const SANITIZE = {
          	auto: (property: string) =>
          		`${findEscapeSequence}.test(${property})?JSON.stringify(${property}).slice(1,-1):${property}`,
          	manual: (property: string) => `${property}`,
          	throw: (property: string) =>
          		`${findEscapeSequence}.test(${property})?(()=>{throw new Error("Property '${property}' contains invalid characters")})():${property}`
          } satisfies Record<Instruction['sanitize'], (v: string) => string>
          
          const joinStringArray = (p: string) =>

github.com/fastify/fast-json-stringify

lib/serializer.js

cc14cc8ab


      
          'use strict'
          
          // eslint-disable-next-line
          const STR_ESCAPE = /[\u0000-\u001f\u0022\u005c\ud800-\udfff]/
          
          module.exports = class Serializer {
            constructor (options) {
              switch (options && options.rounding) {
                case 'floor':
                  this.parseInteger = Math.floor
                  break
                case 'ceil':
                  this.parseInteger = Math.ceil
                  break

but regex are slow!

swhiteman · September 29, 2025, 6:49pm

I think you’re reversing cause and effect here. The restrictive JSON stringifiers are faster because they model output on a schema (either official JSON Schema(tm) or something proprietary) and do not support the entire JSON grammar.

"Well-formed" is thus a misnomer and even the authors of those packages wouldn’t call their happy path "well-formed," it’s just simple enough for them to use all-userland code.

Like you wouldn’t call a CSV "well-formed" if it doesn’t have escaped delimiters and dquotes, would you? You might call it "simple" or "naive" though.

michael · September 30, 2025, 3:29pm

What makes you think that your proposed method would be any faster? And why do you think regexps are slow?

cesco69 · October 1, 2025, 7:01am

About performance, similar to String.prototype.isWellFormed() (docs):

isWellFormed() is more efficient, as engines can directly access the internal representation of strings.

In the same way, JSON.isWellFormed could access the internal representation of strings and thus be more efficient than using regular expressions.

eg. in V8 src/json/json-stringifier.cc - v8/v8 - Git at Google the efficiency comes from vectorized word-level scanning, with a fallback to per-character checks when needed. This is what should be exposed as JSON.isWellFormed

Topic		Replies	Views
JSON.isParseableAsJSON() 💡 Ideas proposal	9	197	August 17, 2025
Make `JSON.stringify` more fast with JSON Schema 💡 Ideas	1	195	March 11, 2024
JSON.equals(x, y) 💡 Ideas	8	216	April 19, 2024
JSON.safeParse()：call JSON.parse() in try/catch? 💡 Ideas	9	3439	November 7, 2023
Make `JSON.stringify` more convenient? 💡 Ideas	6	469	April 21, 2021