Proposal for Adding .xml() and .html() Methods to Async Parsing in ECMAScript

brandonmcconnell · December 20, 2023, 9:18am

Introduction

This proposal aims to introduce two new methods, .xml() and .html(), to the existing asynchronous parsing interface in ECMAScript. Currently, developers have access to .text() and .json() for handling text and JSON responses, respectively. The addition of .xml() and .html() would streamline the processing of XML and HTML content in a manner consistent with the current standards.

Rationale

Consistency and Simplicity: The .text() and .json() methods are widely used due to their simplicity and consistency. Adding .xml() and .html() would extend this ease of use to XML and HTML data, which are common in web development.
Performance Benefits: By integrating these methods into the language standard, parsers can be optimized at the engine level, potentially leading to performance improvements over custom parsing solutions.
Improved Readability and Maintenance: Native methods simplify code, making it more readable and easier to maintain. This aligns with modern JavaScript's goal of providing a more concise and powerful syntax.

Technical Specification

`.xml()` Method

Purpose: Parses the response body as XML.
Usage: response.xml().
Return Type: A promise that resolves with the result of parsing the response body text as XML.

`.html()` Method

Purpose: Parses the response body as HTML.
Usage: response.html().
Return Type: A promise that resolves with the result of parsing the response body text as HTML.

Implementation Notes

The implementation should be in line with the existing structure of .text() and .json().
Error handling for malformed content should be consistent with the handling in .json().
Consideration for security implications, especially with .html(), is crucial to prevent injection attacks.

Use Cases

XML Feeds: Simplifying the consumption of XML feeds like RSS or Atom.
Client-Side Templating: Easier integration of HTML templates fetched from a server.
Web Scraping: Streamlined parsing of HTML for data extraction in web scraping applications.

Potential Challenges

Security Concerns: Particularly with .html(), ensuring safe parsing to prevent cross-site scripting (XSS) attacks.
Browser Support and Polyfills: Ensuring consistent behavior across different JavaScript engines and providing polyfills for backward compatibility.

Conclusion

The addition of .xml() and .html() methods to asynchronous parsing in ECMAScript will significantly benefit developers by providing a consistent and optimized way to handle XML and HTML data. This proposal seeks the TC39's consideration for these additions, which align with the ongoing evolution of JavaScript towards a more powerful and developer-friendly language.

ljharb · December 20, 2023, 4:21pm

On what object are these methods? There's no i/o in Ecmascript, so this might be something more relevant to WHATWG.

tabatkins · December 20, 2023, 6:44pm

Yeah, these would be on the Response object from fetch(), so it should be proposed in the Fetch repo

bergus · December 21, 2023, 6:48am

…where it already has been shot down repeatedly: Proposal: response.xml() · Issue #16 · whatwg/fetch · GitHub

dmchurch · December 24, 2023, 9:04pm

Given that DOM nodes (what I assume would be the return type of a .*ml call) all belong to an owner document, I'd think a method on Document that accepts a Response as parameter, or perhaps a constructor or static method on that or DocumentFragment, would be a more natural place for such a feature. Do you happen to know if it's been proposed as such for the DOM? I can't recall who owns that off the top of my head

claudiameadows · December 25, 2023, 9:24am

That would be exacty what @bergus said: [quote="bergus, post:4, topic:1869, full:true"]
…where it already has been shot down repeatedly: Proposal: response.xml() · Issue #16 · whatwg/fetch · GitHub
[/quote]

dmchurch · December 25, 2023, 2:46pm

yes, it has been shot down as an addition to Fetch. I personally agree that it doesn't have a place on Fetch, so the question becomes "where does this feature belong". That's why I'm curious if this has been proposed as an addition to Document or DocumentFragment, which are not part of Fetch but rather, part of the DOM.

claudiameadows · December 25, 2023, 4:00pm

In any case, it's not really a JS feature, so you'd have better luck asking in various WHATWG repos (in your case, GitHub - whatwg/html: HTML Standard is likely your best bet).

dmchurch · December 25, 2023, 4:34pm

I mean I personally don't have enough interest or motivation to pursue this myself, but the responses @tabatkins and @bergus suggest a message of "this feature has no place in spec", and that's not true. I wanted to provide @brandonmcconnell with an idea for where the feature might be more appropriate, and given that @bergus had some useful info about how this concept has fared when proposed to the Fetch group, I thought that perhaps he, or someone, might have seen similar proposals in a different group or have some other useful information about how likely such a proposal would be to succeed.

SpcFORK · June 25, 2024, 2:40pm

I like this, specifically XML()
It would be nice with modern implementations of these too:
XML
XMLList
Namespace
flash.xml - Adobe ActionScript® 3 (AS3) API Reference

I don't like the promise part. since something like new DOMParser().parseFromString(str, "text/xml"); (in JS) doesn't, and is sync along with no rejection.

Topic		Replies	Views
Proposal: Parser Augementation with CSTML 💡 Ideas	0	283	September 2, 2024
Schema-based JSON parsing? 💡 Ideas	9	613	April 4, 2022
JSON.traverse() 🦋 Proposals proposal	8	419	June 15, 2024
Text Modules 💡 Ideas	6	503	November 8, 2023
More language deprecations and fewer new features 🦋 Proposals	0	625	January 8, 2020