A wiki page to summarise the proposals around detecting ES6 Modules in Node
NOTE PROPOSAL WAS MERGED AS DRAFT:
- merged: /~https://github.com/nodejs/node-eps/blob/master/002-es-modules.md
- counter-proposal: /~https://github.com/dherman/defense-of-dot-js
Current Discussion: nodejs/node-eps#13
Previous Discussion: nodejs/node-eps#3
Glossary:
- CJS Module System refers to CJS's
require
andmodule.exports
/exports
- ES Module System refers to ES6's
import
andexport
- CJS Modules are considered files that use CJS's Module System exclusively
- ES Modules are considered files that use ES6's Module System exclusively
Requirements:
- Ability for CJS Module System to load both CJS Modules and ES Modules
- Ability for ES Module System to load both CJS Modules and ES Modules
- Ability to require another package without knowing which module system it uses
- Ability to publish and require packages that have files of both ES Modules and CJS Modules
Spec compliance, some new syntax is being made exclusively for module parser goal, change to remove deprecated / problematic behaviors, and ease of tooling.
Will the ES Module System be able to import specific files, e.g. import thing from 'package/lib/thing.js'
How can node identify whether a file is a CJS module or an ES6 module?
why does it even need to do this?
Because they have different parses in the EcmaScript specification, and the same ambiguous source code can have different global side effects depending upon that.
If it is constrained that import/require is interoperable between CJS and ES6, what does it take for export/module.exports to also be interoperable?
Attempted questions on this:
Attempted answers:
- nodejs/node-eps#3 (comment)
- nodejs/node-eps#3 (comment)
- nodejs/node-eps#3 (comment)
- nodejs/node-eps#3 (comment)
This section serves to address the available methods for addressing the question "How can node identify whether a file is a CJS module or an ES6 module?"
References:
Note
- These forces distinction on source files programmers haven't had to before.
- Different start non-terminals or contextual expectations haven't led to needing detection in the past. However, the Module goal causes ambiguity in source code like
function v() {}
which we need to resolve prior to execution.
Details: This option will require users to add "use module";
or something similar at the top of every file.
Pros:
- Explicit
Cons:
- Unacceptable boilerplate tax on users (this seems to be a very strong reason) (cause of rejection)
- Implementers will have a hard time implementing a double parser considering that modules source text and javascript source text are parsed differently.
- Would require toolchains to have a full JS parser due to how directives work in the EcmaScript spec (ASI / comments / multiple directives). (cause of rejection)
Details: This option will require users to use .jsm or something similar as the file extension for any ES6 module. Likely to be used exclusively (will break older node versions) or to be used coupled with a .js file (e.g. project/hello.js
and project/hello.jsm
both exist, doing require('project/hello')
will prefer .jsm file if it exists otherwise will use .js file).
Notes:
-
Best-case success story is that .jsm becomes the new extension for JS Module goal. (removed hearsay / browsers do not check file extension / all current VMs only target Script goal)
-
Node shouldn't presume to solve a problem for other tools
- front-end toolchains and devs have discussed this as desirable, this is not purely node's opinion
Pros:
- Explicit
- Self contained files still
- Easy for toolchains to recognize (in particular ones like nvm that can't even parse JSON).
Cons:
-
BIG: migration cost imposed on tools and developers; hard to estimate how many things in the world depend on JavaScript === *.js; how many .htaccess files, config files, scripts, etc. break? Possible obstacle to adoption?
- note that file extensions continue to arise, just like how tools are able to easily use
.jsx
- note that file extensions continue to arise, just like how tools are able to easily use
-
It's solved in the browser by out-of-band configuration ((browsers do not have interop and loader hooks are not specified)<script type="module">
and loader hooks to control the detection in users-land) -
It is a refactor hazard for popular modules containing more than one file, it will break existing dependents that are using .js extension in their require calls (e.g.:Most likely irrelevant as same hazard applies to all of these if mode is ambiguous (such asrequire('foo/bar.js')
)./bar.js
switching based upon out of band meta-data), see discussion about banning inner imports -
Combinatorial explosion of JS extensions like .jsx and .ts (e.g.: how to signal that it is a module with typescript annotations?)As stated in original PR comments this is not a language extension. Also mitigated by the other con regarding script like .htaccess files. Seen as a slippery slope argument. -
Many have already solved this problem without requiring a new extensionNot a con, a statement
References:
Details: Engines providing a way to detect the type by inspection of the code.
Pros:
- Least visible impact on ecosystem, does not affect
package.json
or scripts using*.js
Cons:
- Hard to determine file mode for large files if they barely use a feature that causes a mode switch. (cause of rejection)
- Implementers unlikely to implement a parsing API that auto-detects the mode (this is a very strong reason)
- Implementing this is non-trivial
- Likely hurt the performance of importing
- Ambiguity causes potential implicit semantic changes (cause of rejection)
References
Universal Pros:
- Does not affect scripts using
*.js
Universal Cons:
- Node allows requiring files that are not part of packages, e.g.
require('C:/a.js')
whereC:/package.json
doesn't exist - Will persist in perpetuity and have hints of previous Script goal
- mitigated if
npm init
defaults to move all of module to Module goal somehow
- mitigated if
- Causes tooling to need to perform JSON parsing per module resolution
- mitigated by cache
- many cli tools don't have JSON parsing
- File semantics are no longer self contained to the file
- File mode is unknown without reading a file located, somewhere (developer must search).
Details: package.json
has a "main"
for CJS Module entry and "module"
for ES6 Module entry.
{
// ...
"module": "lib/index.js",
"main": "old/index.js",
// ...
}
Note: "module"
seems like a better option than our previous proposal "jsnext:main"
, which is being used today in many modules.
Pros:
- [package.json pros
- Harmonizes with
<script type="module">
- Allows simultaneous new and old entry point
- Replaces
"main"
(which eventually becomes "the old way") with an ergonomic standard way
Cons:
- package.json cons
- Requires external method for specifying Module System of non-entry points, e.g.
require('lib/something-else.js')
- which kind of defeats the purpose of this option. See other options under 4 that specify this. (cause of rejection)
Details: package.json
has a modules
property that is an array of files or directories that use ES Modules.
{
// ...
// files:
"modules": ["lib/hello.js", "bin/hello.js"],
// directories:
"modules": ["lib", "bin"],
// files and directories:
"modules": ["lib", "bin", "special.js"],
// if package never uses CJS Modules
"modules": ["."],
}
Pros:
- [package.json pros
- Solves requiring an ES Module that was not the entry point
Cons:
Notes:
- Requires external solution for determining the entry point
Updates:
- Updated this proposal from the original files only regex approach, to an array of files and directories as it is cleaner, doesn't require regex knowledge, and can actually be serialised to JSON! ~ @balupton
References:
Details: package.json
has a "editions"
property that lists which editions the package contains, and what their syntaxes are.
{
"editions": [{
// source that contains special features that get compiled away
"syntax": ["ESNext", "ES Modules", "Flow Type", "JSX"],
"entry": "source/index.js"
}, {
// compiled for future browser and node support, only has confirmed upcoming ES features
"syntax": ["ESNext", "ES Modules"],
"entry": "esnext-esm/index.js"
}, {
// compiled for current browsers and node 0.12, 4, and 5 support
"syntax": ["ES2015", "ES Modules"],
"entry": "es2015-esm/index.js"
}, {
// compiled for current browsers and node 0.12, 4, and 5 support
// uses babel-plugin-add-module-exports package to convert ES Modules to CJS Modules
"syntax": ["ES2015", "CJS Modules"],
"entry": "es2015-cjs/index.js"
}, {
// compiled with babel preset 2015-loose for node 0.10 and IE8 support without need for external es2015 polyfills
// uses babel-plugin-add-module-exports package to convert ES Modules to CJS Modules
"syntax": ["ES5", "CJS Modules"],
"entry": "es5-cjs/index.js"
}]
}
Pros:
-
[package.json pros
-
Entry point is selected by which edition the current environment supports, rather than just which edition uses ES Modules or CJS Modules (which may still make use of a feature that is not currently supported by the environment)
-
For systems that only support
"main"
entry point, the package author can specify"main"
to go to either to:- The ES2015 & CJS Module edition, e.g.
"main": "es2015-cjs/index.js"
- To a CJS Module script that automatically detects the correct edition to load
- The ES2015 & CJS Module edition, e.g.
-
Allows future node-core, parsers, bundlers, and compilers to intelligently decide which edition they can support either by built-in support into themselves, or via a
"main"
script that does the same detection -
Allow meta generators like projectz to automatically generate README documentation on which editions the package provides
-
Solves the problem of bundlers like rollup, browserify, webpack re-compiling already compiled ES5 or ES2015 editions rather than the latest edition that is supported by that system and compiling down to the desired target
Cons:
- package.json cons
- EXTREMELY HIGH COMPLEXITY if you don't have a build tool
- path to exponential explosion due to ease of extending
Notes:
- Requires external method for specifying Module System of non-entry points, e.g.
require('lib/something-else.js')
, however this is easily solved by any of these:-
Adding a
"directory"
property to each direction, which would work well, however no way to use ES Modules outside of edition directories -
Coupling with any other proposal.
-
E.g. Coupling with the Modules Listing Proposal would work well and wouldn't require additional tooling or compiler updates, e.g.
{ "modules": ["source", "esnext-esm", "es2015-esm", "non-edition-file.js"], "editions": [ /* ... */ ] }
-
E.g. Coupling with the JSM Extension Proposal would allow for the
-esm
and-cjs
prefixes to go away, as each edition would support both CJS Modules and ES Modules:{ "editions": [{ // source that contains special features that get compiled away "syntax": ["ESNext", "Flow Type", "JSX"], "entry": "source/index.js" }, { // compiled for future browser and node support, only has confirmed upcoming ES features // uses some babel plugin package to also output .jsm files along with the .js files "syntax": ["ESNext"], "entry": "esnext/index" }, { // compiled for current browsers and node 0.12, 4, and 5 support // uses some babel plugin package to also output .jsm files along with the .js files "syntax": ["ES2015"], "entry": "es2015/index" }, { // compiled with babel preset 2015-loose for node 0.10 and IE8 support without need for external es2015 polyfills // uses some babel plugin package to also output .jsm files along with the .js files "syntax": ["ES5"], "entry": "es5-cjs/index" }] }
-
References:
Implementations: