"This is the fourth day of my participation in the update challenge in November. For details: 2021 last challenge」.
Write in front
Webpack can be called a mainstay in the front-end construction tools, including daily business development, front-end infrastructure tools, advanced front-end interview It will appear in any scene.
You may be confused about its internal implementation mechanism. In daily work, you still don't understand the meaning and application mode of various parameters based on the API such as Webpack Plugin/Loader.
In fact, all these reasons are essentially based on Webpack workflow. There is no clear understanding, resulting in the so-called "no way to start with API" development.
In this article, from the perspective of how to realize the packaging of module analysis projects, we will use the most popular, concise and clear code to take you to uncover the mystery behind webpack and take you to implement a simple version of webpack. From then on, we will have a clear understanding of any underlying development related to webpack.
Here we only talk about "dry goods", and take you into the workflow of webpack with the most easy to understand code.
I hope you can master the pre knowledge
Tapable The package is essentially a library for us to create custom events and trigger custom events, which is similar to the EventEmitter Api in Nodejs.
The plug-in mechanism in Webpack is based on the decoupling between Tapable implementation and packaging process. All forms of plug-ins are based on Tapable implementation.
For learning purposes, we will focus on the Webpack Node Api process to explain. In fact, the npm run build command we use in the front end is also to call the bin script through the environment variable to call the Node Api to perform compilation and packaging.
The AST analysis inside Webpack also depends on Babel for processing, if you are not very familiar with Babel. I suggest you read these two articles first "Front end infrastructure" takes you to travel in Babel's world,#Enter the world of Babel plug-in developers from Tree Shaking.
Of course, I will explain the application of these contents in Webpack in detail later, but I hope you can click the document at the top to learn a little about the pre knowledge before reading the article.
Process combing
Before we start, let's sort out the whole packaging process.
Here is just a comb of the whole process. Now you don't need to think in detail about what happens in each step. We will connect them step by step in the next steps.
As a whole, we will analyze the Webpack packaging process from the above five aspects:
- Initialization parameter phase. This step will start from our configured webpack config. The corresponding configuration parameters read in JS and the parameters passed in from the shell command are combined to obtain the final packaged configuration parameters.
- Start compilation preparation phase In this step, we will return a compiler method by calling the webpack() method, create our compiler object, and register each Webpack Plugin. Find the entry code in the configuration entry and call compiler Compile with the run () method.
- Module compilation phase Analyze from the entry module and call the loaders of the matching file to process the file. At the same time, analyze the modules that the module depends on, and compile the module recursively.
- Complete compilation phase After the recursion is completed, each reference module is processed by loaders, and the interdependence between modules is obtained.
- Output file phase After processing the dependent files in the output module to the disk.
Next, let's explore in detail what happened at each step.
Create directory
Sharp tools make good work. First, let's create a good directory to manage the packaging tool we need to implement!
Let's create such a directory:
- webpack/core stores the core code of webpack that we will implement.
- webpack/example holds the instance projects we will use to package.
- webpack/example/webpak.config.js configuration file
- webpack/example/src/entry1 first entry file
- webpack/example/src/entry1 second entry file
- webpack/example/src/index.js module file
- webpack/loaders store our custom loaders.
- webpack/plugins store our custom plugins.
Initialization parameter phase
Often, there are two ways to pass packaging parameters to webpack in the daily use stage. Let's see how to pass parameters first:
Cli command line pass parameters
Usually, when we call the webpack command, we sometimes pass in certain command line parameters, such as:
webpack --mode=production # Call the webpack command to execute packaging and pass in the mode of production
webpack.config.js pass parameters
Another way, I believe, is more commonplace.
We use webpack. Com under the project root directory config. JS export an object for webpack configuration:
const path = require('path') // Introducing loader and plugin module.exports = { mode: 'development', entry: { main: path.resolve(__dirname, './src/entry1.js'), second: path.resolve(__dirname, './src/entry2.js'), }, devtool: false, // The basic directory, the absolute path, is used to resolve the entry point and loader from the configuration. // In other words, all relative paths of entry and loader are relative to this path context: process.cwd(), output: { path: path.resolve(__dirname, './build'), filename: '[name].js', }, plugins: [new PluginA(), new PluginB()], resolve: { extensions: ['.js', '.ts'], }, module: { rules: [ { test: /\.js/, use: [ // There are three ways to use your own loader. Here is just one path.resolve(__dirname, '../loaders/loader-1.js'), path.resolve(__dirname, '../loaders/loader-2.js'), ], }, ], }, };
At the same time, this configuration file is also the instance configuration under example, which we need as the instance project. Next, let's modify example / webpack config. JS is the above configuration.
Of course, you don't need to understand the loader and plugin here at present. Next, we will gradually implement these things and add them to our packaging process.
Implement the phase of merging parameters
In this step, let's really start to realize our webpack!
First, let's create a new index under webpack/core JS file as the core entry file.
At the same time, create a webpack/core and create a new webpack JS file as the implementation file of webpack() method.
First of all, we know that the compiler object is obtained through the webpack() method in NodeJs Api.
At this point, let's supplement the index. COM interface format according to the original webpack interface format Logic in JS:
- We need a webpack method to execute the call command.
- At the same time, we introduce webpack config. JS configuration file is passed into the webpack method.
// index.js const webpack = require('./webpack'); const config = require('../example/webpack.config'); // Step 1: initialize parameters and synthesize parameters according to configuration file and shell parameters const compiler = webpack(config);
Well, it looks good. Next, let's implement webpack js:
function webpack(options) { // Merge parameters get the merged parameter mergeOptions const mergeOptions = _mergeOptions(options); } // Merge parameters function _mergeOptions(options) { const shellOptions = process.argv.slice(2).reduce((option, argv) => { // argv -> --mode=production const [key, value] = argv.split('='); if (key && value) { const parseKey = key.slice(2); option[parseKey] = value; } return option; }, {}); return { ...options, ...shellOptions }; } module.exports = webpack;
What we need to add here is
In the webpack file, you need to export a method named webpack and accept the configuration object passed in from the outside. This is what we talked about above.
Of course, the logic of merging parameters is to finally merge the external incoming object and the incoming parameters when executing the shell.
In Node Js, we can use process argv. Slice (2) to obtain the parameters passed in the shell command, such as:
Of course_ The mergeOptions method is a simple method to merge configuration parameters. I believe it is a piece of cake for everyone.
Congratulations 🎉, A thousand mile trip begins with one step. In this step, we have completed the first step in the packaging process: merging configuration parameters.
Compilation phase
After getting the final configuration parameters, we need to do the following things in the webpack() function:
- Create a compiler object with parameters. We can see that in the official case, a compiler object is returned by calling the webpack(options) method. And call compiler The code started by the run () method is packaged.
- Register the webpack plugin we defined.
- Find the corresponding packaging entry file according to the incoming configuration object.
Create compiler object
Let's finish the index first Complete the logic code in JS:
// index.js const webpack = require('./webpack'); const config = require('../example/webpack.config'); // Step 1: initialize parameters and synthesize parameters according to configuration file and shell parameters // Step 2: call Webpack(options) to initialize the compiler object // The webpack() method returns a compiler object const compiler = webpack(config); // Call the run method for packaging compiler.run((err, stats) => { if (err) { console.log(err, 'err'); } // ... });
As you can see, the core compilation implementation lies in the compiler returned by the webpack() method On the run () method.
Let's improve this webpack() method step by step:
// webpack.js function webpack(options) { // Merge parameters get the merged parameter mergeOptions const mergeOptions = _mergeOptions(options); // Create compiler object const compiler = new Compiler(mergeOptions) return compiler } // ...
Let's also create a new compiler in the webpack/core directory JS file, as the core implementation file of compiler:
// compiler.js // Compiler class for core compilation implementation class Compiler { constructor(options) { this.options = options; } // The run method starts compilation // At the same time, the run method accepts the callback passed externally run(callback) { } } module.exports = Compiler
At this point, our Compiler class will first build a basic skeleton code.
At present, we have:
- webpack/core/index.js is the entry file for packaging commands. This file refers to our own webpack and external webpack config. js(options). Call webpack (options) Run() starts compiling.
- webpack/core/webpack.js this file handles the merging of parameters and passes in the merged parameter new Compiler(mergeOptions), and returns the created Compiler object at the same time.
- webpack/core/compiler. At this time, our compiler is only used as a basic skeleton, and there is a run() startup method.
Write Plugin
Remember when we were on webpack config. Are two plugins - plugina and pluginB plugins used in JS. Next, let's implement them in turn:
Before implementing Plugin, we need to improve the compiler method:
const { SyncHook } = require('tapable'); class Compiler { constructor(options) { this.options = options; // Create plugin hooks this.hooks = { // Hook at start of compilation run: new SyncHook(), // Execute before outputting the asset to the output directory (before writing the file) emit: new SyncHook(), // Execute all when compilation is completed done: new SyncHook(), }; } // The run method starts compilation // At the same time, the run method accepts the callback passed externally run(callback) {} } module.exports = Compiler;
Here, we create an attribute hooks in the constructor of the Compiler class. Its value is the three attributes run, emit and done.
The values of these three attributes are the SyncHook method of tabable, which we mentioned above. In essence, you can simply understand the SyncHook() method as an Emitter Event class.
When we return an object instance through new SyncHook(), we can use this hook. run. The tap ('name ', callback) method adds event listening to this object, and then through this hook. run. Call() executes all tap registered events.
Of course, there are many hooks in the real source code of webpack. And there are synchronous / asynchronous hooks respectively. Here we are more to explain the process clearly for you, so we only list three common and simple synchronous hooks.
At this point, we need to understand that we can use the Compiler on the instance object returned by the Compiler class hooks. run. Tap register hook.
Next, let's switch back to webpack JS, let's fill in the logic about plug-in registration:
const Compiler = require('./compiler'); function webpack(options) { // Merge parameters const mergeOptions = _mergeOptions(options); // Create compiler object const compiler = new Compiler(mergeOptions); // Loading plug-ins _loadPlugin(options.plugins, compiler); return compiler; } // Merge parameters function _mergeOptions(options) { const shellOptions = process.argv.slice(2).reduce((option, argv) => { // argv -> --mode=production const [key, value] = argv.split('='); if (key && value) { const parseKey = key.slice(2); option[parseKey] = value; } return option; }, {}); return { ...options, ...shellOptions }; } // Loading plug-in functions function _loadPlugin(plugins, compiler) { if (plugins && Array.isArray(plugins)) { plugins.forEach((plugin) => { plugin.apply(compiler); }); } } module.exports = webpack;
Here we call compiler after creating the completed object. The loadPlugin method registers the plug-in.
Students who have been in contact with webpack plug-in development may have known it more or less. Any webpack plug-in is a class (of course, the class is essentially the syntax sugar of funciton), and each plug-in must have an apply method.
This apply method will accept a compiler object. What we did above is to call the apply method of the incoming plugin in turn and pass in our compiler object.
Here, I ask you to remember the above process. When we write the webpack plugin, it is essentially to operate the compiler object, which affects the packaging results.
Maybe you don't quite understand the meaning of this sentence at this time. After we complete the whole process in series, I will reveal the answer for you.
Next, let's write these plug-ins:
Students who do not understand plug-in development can go and have a look Official introduction In fact, it's not very difficult. Personally, I strongly suggest that if you don't understand it, you can go and see it first and then come back. You will gain something by combining the content of the above change.
First, let's create a file:
// plugin-a.js // Plug in A class PluginA { apply(compiler) { // Register synchronization hook // The compiler object here is the instance created by new Compiler() compiler.hooks.run.tap('Plugin A', () => { // call console.log('PluginA'); }); } } module.exports = PluginA;
// plugin-b.js class PluginB { apply(compiler) { compiler.hooks.done.tap('Plugin B', () => { console.log('PluginB'); }); } } module.exports = PluginB;
Seeing this, I believe most students have reacted, compiler hooks. done. Isn't tap what we mentioned above, creating a SyncHook instance through tapable and registering events through the tap method?
you 're right! Indeed, the webpack plug-in essentially listens to events on the compiler through the publish subscribe mode. Then, the listening events are triggered during the packaging and compilation process, so as to add some logic to affect the packaging results.
On the apply method of each plug-in, we subscribe to the corresponding events through tap in the compilation preparation stage (that is, when calling the webpack() function). When our compilation is executed to a certain stage, we publish the corresponding events and tell the subscribers to execute the monitored events, so as to trigger the corresponding plugin in different life cycles of the compilation stage.
So you should know here that when we develop webpack plug-ins, the compiler object stores all the relevant properties of this package, such as the configuration of options package and various properties we will talk about later.
Find entry
After that, most of our content will be put in Compiler JS to implement the Compiler class to realize the core process of packaging.
We need to enter the compilation stage from the packaging stage. The first thing is that we need to find the corresponding entry file according to the path of the entry configuration file.
// compiler.js const { SyncHook } = require('tapable'); const { toUnixPath } = require('./utils'); class Compiler { constructor(options) { this.options = options; // Relative path and path Context parameters this.rootPath = this.options.context || toUnixPath(process.cwd()); // Create plugin hooks this.hooks = { // Hook at start of compilation run: new SyncHook(), // Execute before outputting the asset to the output directory (before writing the file) emit: new SyncHook(), // Execute all when compilation is completed done: new SyncHook(), }; } // The run method starts compilation // At the same time, the run method accepts the callback passed externally run(callback) { // When the run mode is called, the plugin that starts compiling is triggered this.hooks.run.call(); // Get portal configuration object const entry = this.getEntry(); } // Get entry file path getEntry() { let entry = Object.create(null); const { entry: optionsEntry } = this.options; if (typeof optionsEntry === 'string') { entry['main'] = optionsEntry; } else { entry = optionsEntry; } // Make entry an absolute path Object.keys(entry).forEach((key) => { const value = entry[key]; if (!path.isAbsolute(value)) { // When converting to absolute path, the unified path separator is/ entry[key] = toUnixPath(path.join(this.rootPath, value)); } }); return entry; } } module.exports = Compiler;
// utils/index.js /** * * The unified path separator is mainly used to facilitate the subsequent generation of module ID * @param {*} path * @returns */ function toUnixPath(path) { return path.replace(/\\/g, '/'); }
In this step, we pass options The entry process obtains the absolute path of the entry file.
Here are a few points to note:
- this.hooks.run.call()
In us_ In the loadePlugins function, each incoming plug-in is subscribed in the compiler instance object, so when we call the run method, it is equivalent to actually starting the compilation. This stage is equivalent to telling the subscriber that the publication starts executing the subscription. At this point, we pass this hooks. run. Call () executes all tap listening methods about run, thus triggering the corresponding plugin logic.
- this.rootPath:
In the above external webpack config. JS, we have configured a context: process CWD (), in fact, the default value of this context in the real webpack is process cwd().
You can see a detailed explanation of it here Context.
In short, this path is the directory path of our project startup. Any relative path in entry and loader is the relative path for the context parameter.
Here we use this Rootpath saves this variable in the constructor.
- toUnixPath tool method:
Because file separation paths are different under different operating systems. Here, we use \ to replace / / in the path to replace the module path. In the future, we will use the path of the module relative to the rootPath as the unique ID of each file, so the path separator is handled uniformly here.
- Processing method of entry:
There are actually many kinds of entry configurations in webpack. Here we consider two common configuration methods:
entry:'entry1.js' // In essence, this code will be converted into entry: { main:'entry1.js }
entry: { 'entry1':'./entry1.js', 'entry2':'/user/wepback/example/src/entry2.js' }
Either of these two methods will be finally transformed into {[module name]: [module absolute path]...} through the getEntry method The geEntry() method is actually very simple, so I won't be too cumbersome about the implementation process of this method.
In this step, we get an object with key as entryname and value as entryAbsolutePath through getEntry method. Let's start the compilation process from the entry file.
Module compilation phase
Above, we talked about the preparations for the compilation phase:
- Directory / file base logic supplement.
- Through hooks Tap registers the webpack plug-in.
- The getEntry method obtains the object of each entry.
Next, let's continue to improve compiler js.
In the module compilation phase, we need to do the following events:
- Analyze the entry file according to the path of the entry file, match the entry file, and process the corresponding loader.
- Compile the entry file processed by loader with webpack.
- Analyze the dependency of the entry file, and repeat the above two steps to compile the corresponding dependency.
- If there are dependent files in the nested file, the dependent module is called recursively for compilation.
- After recursive compilation, assemble chunk s containing multiple modules
First, let's give it to compiler JS constructor, add the corresponding logic:
class Compiler { constructor(options) { this.options = options; // Create plugin hooks this.hooks = { // Hook at start of compilation run: new SyncHook(), // Execute before outputting the asset to the output directory (before writing the file) emit: new SyncHook(), // Execute all when compilation is completed done: new SyncHook(), }; // Save all entry module objects this.entries = new Set(); // Save all dependent module objects this.modules = new Set(); // All code block objects this.chunks = new Set(); // Document object for storing this output this.assets = new Set(); // Save the file name of all the output of this compilation this.files = new Set(); } // ... }
Here, we add some column attributes to the compiler constructor to save the corresponding resource / module objects generated in the compilation stage.
About entries\modules\chunks\assets\files, these Set objects are attributes that run through our core packaging process. They are used to store different resources in the compilation stage, so as to finally generate the compiled file through the corresponding attributes.
Analyze the entry file according to the entry file path
As mentioned above, we can already use this in the run method getEntry(); Get the corresponding entry object ~
Next, let's start from the entry file to analyze the entry file!
class Compiler { // The run method starts compilation // At the same time, the run method accepts the callback passed externally run(callback) { // When the run mode is called, the plugin that starts compiling is triggered this.hooks.run.call(); // Get portal configuration object const entry = this.getEntry(); // Compile entry file this.buildEntryModule(entry); } buildEntryModule(entry) { Object.keys(entry).forEach((entryName) => { const entryPath = entry[entryName]; const entryObj = this.buildModule(entryName, entryPath); this.entries.add(entryObj); }); } // Module compilation method buildModule(moduleName,modulePath) { // ... return {} } }
Here we add a method named buildeentrymodule as the compilation method of the entry module. Loop the entry object to get the name and path of each entry object.
For example, if we pass in entry: {Main: '. / SRC / main. JS'} at the beginning, the formal parameter entry obtained by buildingentrymodule is {main: "/src... [your absolute path]"}. At this time, the entryName accepted by our buildModule method is main, and the entrypath is the absolute path corresponding to the entry file main.
After the compilation of a single entry is completed, we will return an object in the buildModule method. This object is the object after we compile the entry file.
Compiling method of buildModule module
Before coding, let's sort out the buildModule method and what it needs to do:
- buildModule accepts two parameters for module compilation. The first is the name of the entry file to which the module belongs, and the second is the path of the module to be compiled.
- The premise of code compilation of buildModule method is to read the file source code according to the entry file path through fs module.
- After reading the contents of the file, all matching loader is called to process the module and get the result after return.
- After getting the results processed by the loader, analyze the code processed by the loader through babel and compile the code. (this step of compiling is mainly aimed at the require statement and modifying the path of the require statement in the source code).
- If the entry file does not depend on any module (require statement), the compiled module object is returned.
- If there are dependent modules in the entry file, recursive buildModule method is used for module compilation.
Read file contents
- We first call the fs module to read the contents of the file.
const fs = require('fs'); // ... class Compiler { //... // Module compilation method buildModule(moduleName, modulePath) { // 1. Read the original code of the file const originSourceCode = ((this.originSourceCode = fs.readFileSync(modulePath, 'utf-8')); // moduleCode is the modified code this.moduleCode = originSourceCode; } // ... }
Call loader to process the matching suffix file
- Next, after we get the specific contents of the file, we need to match the corresponding loader to compile our source code.
Implement simple custom loader
Before compiling the loader, let's implement the custom loader passed in above.
Create a new loader-1 under the webpack/loader directory js,loader-2. js:
First of all, we need to be clear. Simply speaking, loader is essentially a function that accepts our source code as an input parameter and returns the processed results.
For more details about the features of loader, you can See here , because the article mainly talks about the packaging process, we simply handle loader in reverse order. For more specific loader/plugin development, I will supplement it in detail in subsequent articles.
// loader is essentially a function that accepts the original content and returns the converted content. function loader1(sourceCode) { console.log('join loader1'); return sourceCode + `\n const loader1 = 'https://github.com/19Qingfeng'`; } module.exports = loader1;
function loader2(sourceCode) { console.log('join loader2'); return sourceCode + `\n const loader2 = '19Qingfeng'`; } module.exports = loader2;
Using loader to process files
After making it clear that loader is a simple function, let's give the content to the matching loader for processing before module analysis.
// Module compilation method buildModule(moduleName, modulePath) { // 1. Read the original code of the file const originSourceCode = ((this.originSourceCode = fs.readFileSync(modulePath)), 'utf-8'); // moduleCode is the modified code this.moduleCode = originSourceCode; // 2. Call loader for processing this.handleLoader(modulePath); } // Matching loader processing handleLoader(modulePath) { const matchLoaders = []; // 1. Get all incoming loader rules const rules = this.options.module.rules; rules.forEach((loader) => { const testRule = loader.test; if (testRule.test(modulePath)) { if (loader.loader) { // Only consider loader {test: / \. JS $/ g, use: ['babel loader ']}, {test: / \. JS $/, loader:' Babel loader '} matchLoaders.push(loader.loader); } else { matchLoaders.push(...loader.use); } } // 2. Execute the loader in reverse order to pass in the source code for (let i = matchLoaders.length - 1; i >= 0; i--) { // At present, we only support the loader mode of importing absolute path externally // require import the corresponding loader const loaderFn = require(matchLoaders[i]); // Synchronously process my every compiled moduleCode through loader this.moduleCode = loaderFn(this.moduleCode); } }); }
Here, we use the handleLoader function to match the incoming file path to the loader with the corresponding suffix, and then execute the loader in reverse order to process our code this moduleCode and update each moduleCode synchronously.
Finally, compile this in each module Modulecode will be processed by the corresponding loader.
webpack module compilation phase
In the last step, we experienced that the loader processed our entry file code, and the processed code was saved in this Modulecode.
At this point, after being processed by the loader, we will enter the internal compilation stage of webpack.
What we need to do here is to compile the current module and change the path introduced by all module (require()) statements that the current module depends on into a relative path relative to the following path (this.rootPath).
In short, you need to understand that the result of our compilation here is to change the dependent module path in the source code into a path relative to the path, and establish the basic module dependency at the same time. Later, I will tell you why to compile for path.
Let's continue to improve the buildModule method:
const parser = require('@babel/parser'); const traverse = require('@babel/traverse').default; const generator = require('@babel/generator').default; const t = require('@babel/types'); const tryExtensions = require('./utils/index') // ... class Compiler { // ... // Module compilation method buildModule(moduleName, modulePath) { // 1. Read the original code of the file const originSourceCode = ((this.originSourceCode = fs.readFileSync(modulePath)), 'utf-8'); // moduleCode is the modified code this.moduleCode = originSourceCode; // 2. Call loader for processing this.handleLoader(modulePath); // 3. Call webpack to compile the module and get the final module object const module = this.handleWebpackCompiler(moduleName, modulePath); // 4. Return the corresponding module return module } // Call webpack for module compilation handleWebpackCompiler(moduleName, modulePath) { // Calculate the relative path of the current module relative to the project startup root directory as the module ID const moduleId = './' + path.posix.relative(this.rootPath, modulePath); // Create module object const module = { id: moduleId, dependencies: new Set(), // Absolute path address of the module on which the module depends name: [moduleName], // The entry file to which the module belongs }; // Call babel to analyze our code const ast = parser.parse(this.moduleCode, { sourceType: 'module', }); // Depth first traversal syntax Tree traverse(ast, { // When a require statement is encountered CallExpression:(nodePath) => { const node = nodePath.node; if (node.callee.name === 'require') { // Obtain the relative path of the introduced module in the source code const requirePath = node.arguments[0].value; // Find the absolute path of the module. The current module path + require() corresponds to the relative path const moduleDirName = path.posix.dirname(modulePath); const absolutePath = tryExtensions( path.posix.join(moduleDirName, requirePath), this.options.resolve.extensions, requirePath, moduleDirName ); // Generate moduleId - add the module ID corresponding to the following path into the new dependent module path const moduleId = './' + path.posix.relative(this.rootPath, absolutePath); // Modify the require in the source code through babel to become__ webpack_require__ sentence node.callee = t.identifier('__webpack_require__'); // Modify the modules introduced by the require statement in the source code. All modifications are processed relative to the following path node.arguments = [t.stringLiteral(moduleId)]; // Add the dependency caused by the require statement for the current module (the content is the module ID relative to the root path) module.dependencies.add(moduleId); } }, }); // After traversal, generate new code according to AST const { code } = generator(ast); // Mount the new generated code for the current module module._source = code; // Returns the current module object return module } }
In this step, we have completed the stage of webpack compilation.
It should be noted that:
- Here, we use babel related APIs to compile the require statement. If you don't know much about babel related APIs, you can check my other two articles in the front knowledge. I won't be a burden here
- At the same time, a tryExtensions() tool method is referenced in our code. This method is aimed at the tool method with incomplete suffix name. You can see the specific content of this method later.
- For each file compilation, we will return a module object, which is the top priority.
- id attribute, indicating that the current module is aimed at this Relative directory of rootpath.
- dependencies attribute, which is a Set that internally stores the module ID s of all modules that the module depends on.
- name attribute, which indicates which entry file the module belongs to.
- _ source attribute, which stores the string code of the module itself after babel compilation.
Implementation of tryExtensions method
We are in webpack. Com above config. JS has such a configuration:
Students familiar with webpack configuration may know that resolve Extensions is aimed at adding suffixes to files according to the incoming rules without writing file suffixes when introducing dependencies.
After understanding the principle, let's take a look at the implementation of utils/tryExtensions method:
/** * * * @param {*} modulePath Module absolute path * @param {*} extensions Extension array * @param {*} originModulePath Original incoming module path * @param {*} moduleContext Module context (current module directory) */ function tryExtensions( modulePath, extensions, originModulePath, moduleContext ) { // Try the no extension option first extensions.unshift(''); for (let extension of extensions) { if (fs.existsSync(modulePath + extension)) { return modulePath + extension; } } // No matching file throw new Error( `No module, Error: Can't resolve ${originModulePath} in ${moduleContext}` ); }
This method is very simple. We use FS Existssync checks the incoming file and traverses it in combination with extensions to find out whether the corresponding matching path exists. If it is found, it returns directly. If not found, give a friendly prompt error.
Attention should be paid to extensions unshift(''); This is to prevent users from directly searching if they have passed in suffixes. If the file can be found, we will return it directly. It will try in turn when it cannot be found.
Recursive processing
After the processing in the previous step, we can call buildModule for the entry file to get such a return object.
Let's take a look at running webpack / core / index JS get the return result.
I printed the entries object after processing in buildingentrymodule. You can see, as we expected before:
- id is the module of each module relative to the following path (here we configure context:process.cwd()) as the webpack directory.
- Dependencies are the internal dependencies of the module, which has not been added here at present.
- Name is the name of the entry file to which the module belongs.
- _ Source is the compiled source code of the module.
At present_ The content in source is based on
Now let's open the src directory and add some dependencies and contents to our two entry files:
// webpack/example/entry1.js const depModule = require('./module'); console.log(depModule, 'dep'); console.log('This is entry 1 !'); // webpack/example/entry2.js const depModule = require('./module'); console.log(depModule, 'dep'); console.log('This is entry 2 !'); // webpack/example/module.js const name = '19Qingfeng'; module.exports = { name, };
At this point, let's re run webpack / core / index js
OK, so far, our compilation for entry can come to an end temporarily.
In short, in this step, we analyze and compile the entry through the ` ` method to get an object. Add this object to this Entries.
Next, let's deal with the dependent modules.
In fact, for the dependent modules, the same steps are the same:
- Check whether there are dependencies in the entry file.
- If there are dependencies, recursively call the buildModule method to compile the module. The passed in moduleName is the entry file to which the current module belongs. modulePath is the absolute path of the currently dependent module.
- Similarly, check recursion to check whether there is still dependency inside the dependent module. If so, compile the module with recursive dependency. This is a depth first process.
- Save each compiled module into this Modules.
Next, we just need to make a slight change in the handleWebpackCompiler method:
// Call webpack for module compilation handleWebpackCompiler(moduleName, modulePath) { // Calculate the relative path of the current module relative to the project startup root directory as the module ID const moduleId = './' + path.posix.relative(this.rootPath, modulePath); // Create module object const module = { id: moduleId, dependencies: new Set(), // Absolute path address of the module on which the module depends name: [moduleName], // The entry file to which the module belongs }; // Call babel to analyze our code const ast = parser.parse(this.moduleCode, { sourceType: 'module', }); // Depth first traversal syntax Tree traverse(ast, { // When a require statement is encountered CallExpression: (nodePath) => { const node = nodePath.node; if (node.callee.name === 'require') { // Obtain the relative path of the introduced module in the source code const requirePath = node.arguments[0].value; // Find the absolute path of the module. The current module path + require() corresponds to the relative path const moduleDirName = path.posix.dirname(requirePath); const absolutePath = tryExtensions( path.posix.join(moduleDirName, requirePath), this.options.resolve.extensions, moduleName, moduleDirName ); // Generate moduleId - add the module ID corresponding to the following path into the new dependent module path const moduleId = './' + path.posix.relative(this.rootPath, absolutePath); // Modify the require in the source code through babel to become__ webpack_require__ sentence node.callee = t.identifier('__webpack_require__'); // Modify the modules introduced by the require statement in the source code. All modifications are processed relative to the following path node.arguments = [t.stringLiteral(moduleId)]; // Add the dependency caused by the require statement for the current module (the content is the module ID relative to the root path) module.dependencies.add(moduleId); } }, }); // After traversal, generate new code according to AST const { code } = generator(ast); // Mount the new generated code for the current module module._source = code; // Recursive dependency depth traversal is added if there are dependent modules module.dependencies.forEach((dependency) => { const depModule = this.buildModule(moduleName, dependency); // Add any compiled dependent module objects to the modules object this.modules.add(depModule); }); // Returns the current module object return module; }
Here we add such a code:
// Recursive dependency depth traversal is added if there are dependent modules module.dependencies.forEach((dependency) => { const depModule = this.buildModule(moduleName, dependency); // Add any compiled dependent module objects to the modules object this.modules.add(depModule); });
Here, we recursively call buildModule for the dependent module, and add the output module object into this Modules.
At this point, let's re run webpack / core / index JS. Here, I print assets and modules after buildingentrymodule compilation:
Set { { id: './example/src/entry1.js', dependencies: Set { './example/src/module.js' }, name: [ 'main' ], _source: 'const depModule = __webpack_require__("./example/src/module.js");\n' + '\n' + "console.log(depModule, 'dep');\n" + "console.log('This is entry 1 !');\n" + "const loader2 = '19Qingfeng';\n" + "const loader1 = 'https://github.com/19Qingfeng';" }, { id: './example/src/entry2.js', dependencies: Set { './example/src/module.js' }, name: [ 'second' ], _source: 'const depModule = __webpack_require__("./example/src/module.js");\n' + '\n' + "console.log(depModule, 'dep');\n" + "console.log('This is entry 2 !');\n" + "const loader2 = '19Qingfeng';\n" + "const loader1 = 'https://github.com/19Qingfeng';" } } entries Set { { id: './example/src/module.js', dependencies: Set {}, name: [ 'main' ], _source: "const name = '19Qingfeng';\n" + 'module.exports = {\n' + ' name\n' + '};\n' + "const loader2 = '19Qingfeng';\n" + "const loader1 = 'https://github.com/19Qingfeng';" }, { id: './example/src/module.js', dependencies: Set {}, name: [ 'second' ], _source: "const name = '19Qingfeng';\n" + 'module.exports = {\n' + ' name\n' + '};\n' + "const loader2 = '19Qingfeng';\n" + "const loader1 = 'https://github.com/19Qingfeng';" } } modules
You can see that we have added module JS dependency has been successfully added to modules, and it has also been processed by loader. But we found it repeated twice.
This is because module JS has been referenced twice. It has been relied on by both entry1 and entry2. During recursive compilation, we have buildModule the same module twice.
Let's deal with this problem:
handleWebpackCompiler(moduleName, modulePath) { ... // Modify the require in the source code through babel to become__ webpack_require__ sentence node.callee = t.identifier('__webpack_require__'); // Modify the modules introduced by the require statement in the source code. All modifications are processed relative to the following path node.arguments = [t.stringLiteral(moduleId)]; // The array converted to ids is easy to handle const alreadyModules = Array.from(this.modules).map((i) => i.id); if (!alreadyModules.includes(moduleId)) { // Add the dependency caused by the require statement for the current module (the content is the module ID relative to the root path) module.dependencies.add(moduleId); } else { // If it already exists, it is not necessary to add it into the module compilation, but it is still necessary to update the entry that the module depends on this.modules.forEach((value) => { if (value.id === moduleId) { value.name.push(moduleName); } }); } } }, }); ... }
Here, in each dependency transformation of code analysis, first judge this Whether the module object already exists in the current module (judged by the unique module id path).
If it does not exist, add it into the dependency for compilation. If the module already exists, it proves that the module has been compiled. So at this time, we don't need to compile it again. We just need to update the chunk to which the module belongs and add the current chunk name to its name attribute.
Run it again. Let's look at the print results:
Set(2) { { id: './example/src/entry1.js', dependencies: Set(1) { './example/src/module.js' }, name: [ 'main' ], _source: 'const depModule = __webpack_require__("./example/src/module.js");\n' + '\n' + "console.log(depModule, 'dep');\n" + "console.log('This is entry 1 !');\n" + "const loader2 = '19Qingfeng';\n" + "const loader1 = 'https://github.com/19Qingfeng';" }, { id: './example/src/entry2.js', dependencies: Set(0) {}, name: [ 'second' ], _source: 'const depModule = __webpack_require__("./example/src/module.js");\n' + '\n' + "console.log(depModule, 'dep');\n" + "console.log('This is entry 2 !');\n" + "const loader2 = '19Qingfeng';\n" + "const loader1 = 'https://github.com/19Qingfeng';" } } Entry file Set(1) { { id: './example/src/module.js', dependencies: Set(0) {}, name: [ 'main', 'second' ], _source: "const name = '19Qingfeng';\n" + 'module.exports = {\n' + ' name\n' + '};\n' + "const loader2 = '19Qingfeng';\n" + "const loader1 = 'https://github.com/19Qingfeng';" } } modules
At this point, our "module compilation phase" is basically over. In this step, we analyze all modules from the entry file.
- Starting from the entry, read the contents of the entry file and call the matching loader to process the entry file.
- Analyze the dependencies through babel, and replace all dependent paths with options relative to the project startup directory at the same time Path of context.
- If there are dependencies in the entry file, recursively compile the dependent modules in the above steps.
- Add the compiled object of each dependent module to this modules.
- Add the compiled object of each entry file to this entries.
Compilation completion phase
In the previous step, we completed the compilation between modules and filled the contents of module and entry respectively.
After the recursive compilation of all modules, we need to combine the final output chunk module according to the above dependencies.
Let's continue to transform our Compiler:
class Compiler { // ... buildEntryModule(entry) { Object.keys(entry).forEach((entryName) => { const entryPath = entry[entryName]; // Call buildModule to realize the real module compilation logic const entryObj = this.buildModule(entryName, entryPath); this.entries.add(entryObj); // According to the interdependence between the current entry file and the module, it is assembled into chunk s containing all the dependent modules of the current entry this.buildUpChunk(entryName, entryObj); }); console.log(this.chunks, 'chunks'); } // Assemble chunks according to the entry file and dependent modules buildUpChunk(entryName, entryObj) { const chunk = { name: entryName, // Each entry file acts as a chunk entryModule: entryObj, // entry compiled object modules: Array.from(this.modules).filter((i) => i.name.includes(entryName) ), // Find all module s related to the current entry }; // Add chunk to this Chunks this.chunks.add(chunk); } // ... }
Here, we find all the dependent files of the corresponding entry through the name attribute of each module according to the corresponding entry file.
Let's take a look at this first What will chunks eventually output:
Set { { name: 'main', entryModule: { id: './example/src/entry1.js', dependencies: [Set], name: [Array], _source: 'const depModule = __webpack_require__("./example/src/module.js");\n' + '\n' + "console.log(depModule, 'dep');\n" + "console.log('This is entry 1 !');\n" + "const loader2 = '19Qingfeng';\n" + "const loader1 = 'https://github.com/19Qingfeng';" }, modules: [ [Object] ] }, { name: 'second', entryModule: { id: './example/src/entry2.js', dependencies: Set {}, name: [Array], _source: 'const depModule = __webpack_require__("./example/src/module.js");\n' + '\n' + "console.log(depModule, 'dep');\n" + "console.log('This is entry 2 !');\n" + "const loader2 = '19Qingfeng';\n" + "const loader1 = 'https://github.com/19Qingfeng';" }, modules: [] } }
In this step, * * we get two chunk s * * in the final output of Webpack.
They each have:
- Name: name of the current entry file
- entryModule: the compiled object of the entry file.
- modules: an array of all module objects that the entry file depends on. The format of each element is consistent with that of entryModule.
At this time, the compilation is completed, and the link of assembling chunk is successfully completed.
Output file phase
Let's first put this assembled after all the compilation in the previous step chunks.
Analyze the original packaging output
Here, I put webpack / core / index JS has been modified as follows:
- const webpack = require('./webpack'); + const webpack = require('webpack') ...
Use the original webpack instead of our own webpack to pack it first.
Run webpack / core / index JS, we will get two files in webpack/src/build: main JS and second JS, we use one of the main JS to see its content:
(() => { var __webpack_modules__ = { './example/src/module.js': (module) => { const name = '19Qingfeng'; module.exports = { name, }; const loader2 = '19Qingfeng'; const loader1 = 'https://github.com/19Qingfeng'; }, }; // The module cache var __webpack_module_cache__ = {}; // The require function function __webpack_require__(moduleId) { // Check if module is in cache var cachedModule = __webpack_module_cache__[moduleId]; if (cachedModule !== undefined) { return cachedModule.exports; } // Create a new module (and put it into the cache) var module = (__webpack_module_cache__[moduleId] = { // no module.id needed // no module.loaded needed exports: {}, }); // Execute the module function __webpack_modules__[moduleId](module, module.exports, __webpack_require__); // Return the exports of the module return module.exports; } var __webpack_exports__ = {}; // This entry need to be wrapped in an IIFE because it need to be isolated against other modules in the chunk. (() => { const depModule = __webpack_require__( /*! ./module */ './example/src/module.js' ); console.log(depModule, 'dep'); console.log('This is entry 1 !'); const loader2 = '19Qingfeng'; const loader1 = 'https://github.com/19Qingfeng'; })(); })();
Here I manually delete the redundant comments generated after packaging and simplify the code.
Let's analyze the code generated by the original package:
The packaged code of webpack defines a__ webpack_require__ The function of replaces the require method inside NodeJs.
At the same time, the bottom
This code is familiar to everyone. This is our compiled entry file code. At the same time, the code at the top is an object defined by all modules that the entry file depends on:
Here is a definition__ webpack__ For the object of modules, * * the key of the object is the relative path of the dependent module relative to the following path, and the value of the object is the compiled code of the dependent module`
Output file phase
Next, after analyzing the original packaged code of webpack, let's continue with the previous step. Through our this Chunks to try to output the final effect.
Let's go back to the run method on the Compiler:
class Compiler { } // The run method starts compilation // At the same time, the run method accepts the callback passed externally run(callback) { // When the run mode is called, the plugin that starts compiling is triggered this.hooks.run.call(); // Get portal configuration object const entry = this.getEntry(); // Compile entry file this.buildEntryModule(entry); // Export list; Each chunk is then converted into a separate file and added to the output list assets this.exportFile(callback); }
After the buildingentrymodule module is compiled, we use this The exportfile method implements the logic of exporting files.
Let's take a look at this Exportfile method:
// Add chunk to the output list exportFile(callback) { const output = this.options.output; // Generate assets content according to chunks this.chunks.forEach((chunk) => { const parseFileName = output.filename.replace('[name]', chunk.name); // In assets {'main.js':' generated string code... '} this.assets[parseFileName] = getSourceCode(chunk); }); // Call Plugin emit hook this.hooks.emit.call(); // First judge whether the directory has direct FS If write does not exist, create it first if (!fs.existsSync(output.path)) { fs.mkdirSync(output.path); } // Save all generated file names in files this.files = Object.keys(this.assets); // Generate a packaged file from the contents of assets and write it to the file system Object.keys(this.assets).forEach((fileName) => { const filePath = path.join(output.path, fileName); fs.writeFileSync(filePath, this.assets[fileName]); }); // Trigger hook after completion this.hooks.done.call(); callback(null, { toJson: () => { return { entries: this.entries, modules: this.modules, files: this.files, chunks: this.chunks, assets: this.assets, }; }, }); }
exportFile does the following:
- First, get the output configuration of the configuration parameters and iterate our this Chunks, output The [name] in filename is replaced with the corresponding entry file name. At the same time, according to the contents of chunks, it is this Add the file name and file content that need to be packaged in assets.
- Call the emit hook function of plugin before writing the file to disk.
- Judge output Whether the path folder exists. If it does not exist, create a new folder through fs.
- Store all the file names generated by this packaging (the array composed of the key value of this.assets) into files.
- Loop this Assets, write the files to the corresponding disk in turn.
- When all packaging processes are completed, the done hook of the webpack plug-in is triggered.
- At the same time, echo the NodeJs Webpack APi and call the run method to pass in two parameters for the externally passed callback.
In general, this What assets does is also relatively simple, that is, get assets by analyzing chunks, and then output the corresponding code to disk.
Take a closer look at the code above and you'll find. this. The value of each element in the assets Map is generated by calling the getSourceCode(chunk) method to generate the code corresponding to the module.
So how does getSourceCode generate our final compiled code according to chunk? Let's have a look!
getSourceCode method
First, let's briefly clarify the responsibilities of this method. We need the getSourceCode method to accept the incoming chunk object. This returns the source code of the chunk.
No more nonsense. In fact, I used a lazy method here, but it doesn't prevent you from understanding the webpack process. Above, we analyzed the original webpack packaged code, only the entry file and module dependency are different in each packaging, and the require method is the same.
To grasp the differences of each time, let's take a look at its implementation:
// webpack/utils/index.js ... /** * * * @param {*} chunk * name Property entry file name * entryModule Entry file module object * modules Dependent module path */ function getSourceCode(chunk) { const { name, entryModule, modules } = chunk; return ` (() => { var __webpack_modules__ = { ${modules .map((module) => { return ` '${module.id}': (module) => { ${module._source} } `; }) .join(',')} }; // The module cache var __webpack_module_cache__ = {}; // The require function function __webpack_require__(moduleId) { // Check if module is in cache var cachedModule = __webpack_module_cache__[moduleId]; if (cachedModule !== undefined) { return cachedModule.exports; } // Create a new module (and put it into the cache) var module = (__webpack_module_cache__[moduleId] = { // no module.id needed // no module.loaded needed exports: {}, }); // Execute the module function __webpack_modules__[moduleId](module, module.exports, __webpack_require__); // Return the exports of the module return module.exports; } var __webpack_exports__ = {}; // This entry need to be wrapped in an IIFE because it need to be isolated against other modules in the chunk. (() => { ${entryModule._source} })(); })(); `; } ...
This code is actually very, very simple, far less difficult than you think! It's a little back to nature, isn't it? Ha ha.
In the getSourceCode method, we get the corresponding by combining the chunk s:
- Name: the name of the output file corresponding to the entry file.
- entryModule: store the compiled object of the entry file.
- Modules: the object that stores all modules that the entry file depends on.
We realized it by string splicing__ webpack__ The attributes on the modules object are also spliced with the code of the entry / exit file through ${entryModule._source} at the bottom.
Here, we mentioned above why we need to convert the path of the module's require method into a path relative to the context. I believe we all know why to do so. Because what we finally achieved__ webpack_require__ Methods are all required methods implemented for the relative path between the module and the path.
At the same time, if it is not clear how the require method changes, it is called__ webpack_require__ Method students can go back to our compilation chapter and review it carefully ~ we turn the require method call into__ webpack_require__.
be accomplished
So far, let's go back to webpack / core / index JS. Re run this file, and you will find an additional build directory in the webpack/example directory.
In this step, we will perfectly realize our own webpack.
In fact, for the implementation of a simple version of the webpack core, I still hope you can fully understand the compiler object while understanding its workflow.
In any subsequent underlying development related to webpack, we should really know the usage of compiler. Understand how various attributes on the compiler affect the compilation and packaging results.
Let's make a perfect ending with a flow chart:
Write at the end
First of all, thank everyone who can see here.
This article has a certain knowledge threshold, and most of the code part. I admire everyone who can read to the end.
The article will come to an end here for the implementation of a simple version of webpack. In fact, this is just the most basic version of webpack workflow.
But it is through such a small 🌰 We can really get started with the core workflow of webpack. I hope this article can play a better auxiliary role in understanding webpack.
In fact, after a clear understanding of the basic workflow, the development of loader and plugin is handy. The development introduction of these two parts in this article is relatively superficial. Later, I will update the detailed development process of loader and plugin respectively. Interested students can pay attention in time 😄.
The code in the article can be found in Download here , this simple version of webpack, I will continue to improve the logical processing of more workflows in the code base.
At the same time, the code here I want to emphasize is the explanation of the source code process. The real webpack will be much more complex than here. In order to facilitate your understanding, it has been deliberately simplified, but the core workflow is basically consistent with the source code.