Add C++ modules support #2291

mikomikotaishi · 2025-12-08T04:18:58Z

This pull request adds support for C++20 modules through CMake. It is enabled by the HTTPLIB_BUILD_MODULES option (which requires HTTPLIB_COMPILE to be enabled, though it probably doesn't have to - I only forced this requirement because it seems to make the most sense to force the library to compile if modules are to be compiled).

Spixmaster · 2025-12-08T09:06:56Z

Great work. I also thought about this.

The approach is very similiar to what was done to https://github.com/nlohmann/json with nlohmann/json#4799. Then, I noticed ...

Is this temporary approach with a file with using ... for the meantime while modules and no modules are supported?

This approach is not using modules natively but rather as an interface to the original way.

Does this method work without disadvantages?

mikomikotaishi · 2025-12-08T09:24:04Z

This is the best way to my knowledge to support modules on top of a header-only or header/source library, allowing continued support for older versions while providing newer features as an option.

I'm not aware of any disadvantages to it besides a being additional translation unit to compile, but if I am wrong please correct me. The only glaring difference in API is that detail symbols are hidden as they are not exported, but in my opinion that's probably better not to expose detail symbols and flood IDE suggestions with implementation details.

Spixmaster · 2025-12-08T10:40:22Z

What about compiled libraries? Is it possible to have the traditional method and modules installed in parallel?

I am thinking of repositories that ship compiled .so or .a.

This is relevant here, https://aur.archlinux.org/packages/cpp-httplib-compiled .

mikomikotaishi · 2025-12-08T10:42:57Z

Yes I believe it's possible to use shared/static libraries with modules, all of my modular projects compiled to shared libraries that an executable consumes

yhirose · 2025-12-08T23:00:52Z

@mikomikotaishi, thanks for the fine pull request! It's fantastic, but my concern is that someone needs to update modules/httplib.cppm whenever new external symbols are added or existing symbols are removed/renamed, and it could happen quite often. I am not planning to maintain this file. So if you cannot keep maintaining the file, I am probably not merge it since the file can be easily out-of-date. Is there a way to generate modules/httplib.cppm from httplib.h automatically?

@sum01 @jimmy-park @Tachi107 do you have any thought about this pull request?

mikomikotaishi · 2025-12-08T23:31:22Z

I could create a Python script, or some other kinds of automated means of updating, which you could run every time it is updated. Until then I would be OK with maintaining this file, as it is a simple process. Such a script would probably comb through the file and add any symbols not part of a detail or internal namespace, or prefixed with an underscore, etc.

However, I am curious why it is not feasible to update the file manually. In case it isn't clear how, one can update the file by adding a using httplib::NEW_SYMBOL_HERE; into the export namespace httlib { } block, for each new symbol that is added. Do let me know if any more information is needed.

mikomikotaishi · 2025-12-08T23:34:20Z

I have also seen some repositories use bots to push some commits too. Potentially one such bot could be set up to automatically populate the module with new changes each time there is a mismatch. I don't know anything about how to set this up, but I have seen this before and it could potentially be a solution (but I think the simplest one is just to run a Python script each time any update to the library happens).

mikomikotaishi · 2025-12-08T23:59:37Z

Anyway, I think this could be one such way of automatically updating the module.

yhirose · 2025-12-09T00:54:47Z

@mikomikotaishi thanks for the additional explanation. I am ok with the following your suggestion.

I have also seen some repositories use bots to push some commits too. Potentially one such bot could be set up to automatically populate the module with new changes each time there is a mismatch. I don't know anything about how to set this up, but I have seen this before and it could potentially be a solution.

We could automatically generate modules/httplib.cppm in a GitHub Actions script when httplib.h is pushed. (I can't accept the way to run a script to update the file manually, since I don't maintain any build systems in this repository. Please see #955 (comment).)

mikomikotaishi · 2025-12-09T01:26:46Z

OK, that makes sense to me. (I don't know anything about how to run GitHub Actions or write scripts for it however, so I'm afraid the most I can do is create a script for this.)

mikomikotaishi · 2025-12-09T02:09:18Z

I'm not sure why there were failing workflows as I didn't change anything in the core library

mikomikotaishi · 2025-12-09T02:17:26Z

Never mind, it seems the failing CI is happening upstream too.

split.py

update_modules.py

yhirose · 2025-12-09T02:42:25Z

.github/workflows/update-module-exports.yml

Thanks for adding this GitHub Action. But after further consideration, I now think this process should be done in CMakeLists.txt. You could generate modules/httplib.cppm under this line

cpp-httplib/CMakeLists.txt

Line 216 in 87c2b4e

if(HTTPLIB_COMPILE)

We actually do the similar to generate httplib.h and httplib.cc with split.py and put them in a distribution package. By doing this, we no longer need to keep modules/httplib.cppm in this repository. It will be created on the fly only when necessary.

@sum01 @jimmy-park Is my comment above correct?

I think it would be difficult to parse an especially large file without syntax analysis or parsing library. I don't have a lot of experience with this sort of thing, and because the header contains things like method definitions outside of the class it seems to be a very complex task. Then including a C++ parser would require a dependency even for a source generation script which would be additional bloat.

I guess this could be done by attaching to the header file that split.py generates, but I haven't actually seen what that looks like.

As for dynamically generating the module, this was raised before on the Dear ImGui library which currently does something like this. I think this is a bad choice for module API (having a static file is obviously best for consumers to be able to just read it from a distance without additional interaction, i.e. a "what-you-see-is-what-you-get" sort of API), but ultimately as you are in charge I'll look into this if you prefer it.

I'm looking more into this and I think if we want to allow CMake to compile the module if it's generated into an "out" directory, we have to force the out directory to be named "out" so that it can be written into the CMakeLists.txt script

Anyway @yhirose do you have any opinion on the output directory having a hard-coded name?

@sum01 @jimmy-park @Tachi107 could you please answer this question?

Tachi107 · 2025-12-09T16:43:53Z

If the file can be run during the build process (and if the output consists of machine-generated files, it should *only* run during build time), then the destination directory should be configurable (maybe defaulting to the the current working directory). Even better, if the output is a single file, the script should allow the user to specify the output file itself (full path). This is because downstreams (like Debian, which is what I maintain the meson build scripts for) may have some requirements on where build products should be stored.

mikomikotaishi · 2025-12-09T18:05:29Z

@Tachi107 CMake needs to know what the output directory is ahead of time to compile the module. How do you propose to solve this?

mikomikotaishi · 2025-12-11T07:28:19Z

@yhirose I think there is one possible solution to allow both the directory to be user-specified while still supporting CMake module building, which is probably just to have the Python script generate the CMake file too. I don't know if this is too convoluted or awkward of a design though, so please do tell me your thoughts.

Tachi107 · 2025-12-11T11:02:12Z

Hi Miko,

On Tue Dec 9, 2025 at 10:46 PM CET, Miko wrote: @Tachi107 CMake needs to know what the output directory is ahead of time to compile the module. How do you propose to solve this?

You should use CMake's add_custom_command() function to invoke the script and pass it the output file path https://cmake.org/cmake/help/latest/command/add_custom_command.html Meson has a similar function, but I can do that myself after this gets merged.

Tachi107

Some feedback now that I have been able to look at the whole code :)

Tachi107 · 2025-12-11T14:07:10Z

modules/CMakeLists.txt

+target_sources(httplib_module
+    PUBLIC
+        FILE_SET CXX_MODULES FILES
+            httplib.cppm
+)


This requires CMake 3.28, but this project only requires CMake 3.0.

C++ modules require a later version of CMake, so activating modules should require the user has the correct version.

Tachi107 · 2025-12-11T14:07:40Z

CMakeLists.txt

 option(HTTPLIB_USE_NON_BLOCKING_GETADDRINFO "Enables the non-blocking alternatives for getaddrinfo." ON)
 option(HTTPLIB_REQUIRE_ZSTD "Requires ZSTD to be found & linked, or fails build." OFF)
 option(HTTPLIB_USE_ZSTD_IF_AVAILABLE "Uses ZSTD (if available) to enable zstd support." ON)
+option(HTTPLIB_BUILD_MODULES "Build httplib modules (requires HTTPLIB_COMPILE to be ON)." OFF)


Why would this need to be an option? Does it add any other dependency? Cannot it be always be built, without an option?

I want to leave it as opt-in because C++ module support is compiler dependent. Most newer compiler versions offer it, but older ones do not. The build could break if we force it on older toolchains.

Tachi107 · 2025-12-11T14:13:14Z

update_modules.py

Which Python version does this script require? Is there a reason why, for example, Optional[str] is used instead of the built-in str | None introduced by PEP 604?

I am just used to writing Optional[T], if T | None is better I can replace it

Anyway I believe this requires a minimum of Python 3.5 now

Tachi107 · 2025-12-11T14:29:20Z

update_modules.py

+def get_git_diff(file_path: str, base_ref: str = "HEAD") -> Optional[str]:
+    """
+    Get the git diff for a specific file.
+    
+    @param file_path Path to the file to diff
+    @param base_ref Git reference to compare against (default: HEAD)
+    @return The git diff output, or None if error
+    """
+    try:
+        result: CompletedProcess = subprocess.run(
+            ["git", "diff", base_ref, "--", file_path],
+            capture_output=True,
+            text=True,
+            check=True
+        )
+        return result.stdout
+    except CalledProcessError as e:
+        print(f"Error getting git diff: {e}", file=sys.stderr)
+        return None


Requiring a Git repository to built the modules is something many users would prefer to avoid. For example, users building from the release tarballs produced by GitHub will not have any Git history, and this would not work.

A way to resolve this would be to always generate the modules file from scratch, so that no diffing operations are required.

This was the suggestion @yhirose gave, and while I personally would prefer to have a static module for the purpose of having an obvious or clear module API (rather than behind a script) this is the approach I'm taking because the author does not maintain the module file or build system scripts.

That being said, this Python script is only supposed to be run by a GitHub Action script, so it does rely on the assumption of Git setup.

@mikomikotaishi you probably misunderstood what I mentioned in the following comment. I said exactly the same thing as @Tachi107 mentioned above: "Could you please simply regenerate httplib.cppm from scratch with the latest httplib.h? It will make this script much simpler".
#2291 (comment)

Yes, that is what I was saying, we would be generating it on demand.

There are some problems I am having with implementing this, for example having to insert the #ifdefs to add to the generated module all of the SSL classes. Do you have any suggestions for this?

Sorry for the confusion... What I meant by 'from scratch' was to create httplib.cppm entirely, rather than just updating a portion of it. So we would like you not to use get_git_diff.

Yes, I meant that I would be regenerating it entirely. My only problem is how to generate the symbols with the aforementioned.

mikomikotaishi added 3 commits December 7, 2025 23:16

Add C++ modules support

41eb63f

Add module examples

8bda27b

Missing semicolon

9e4ca69

mikomikotaishi added 3 commits December 8, 2025 20:57

Update GitHub Actions script and create a modules updating script

6eeb13a

Name the unused param

36b1c16

Merge branch 'yhirose:master' into master

97fa023

yhirose reviewed Dec 9, 2025

View reviewed changes

split.py Show resolved Hide resolved

yhirose reviewed Dec 9, 2025

View reviewed changes

update_modules.py Show resolved Hide resolved

yhirose reviewed Dec 9, 2025

View reviewed changes

Merge branch 'master' into master

6655639

mikomikotaishi mentioned this pull request Dec 9, 2025

Update split.py file #2295

Merged

Merge branch 'yhirose:master' into master

66a0864

Tachi107 suggested changes Dec 11, 2025

View reviewed changes

Merge branch 'yhirose:master' into master

e81bc86

Add C++ modules support #2291

Are you sure you want to change the base?

Add C++ modules support #2291

Conversation

mikomikotaishi commented Dec 8, 2025

Uh oh!

Spixmaster commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikomikotaishi commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Spixmaster commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikomikotaishi commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yhirose commented Dec 8, 2025

Uh oh!

mikomikotaishi commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikomikotaishi commented Dec 8, 2025

Uh oh!

mikomikotaishi commented Dec 8, 2025

Uh oh!

yhirose commented Dec 9, 2025

Uh oh!

mikomikotaishi commented Dec 9, 2025

Uh oh!

mikomikotaishi commented Dec 9, 2025

Uh oh!

mikomikotaishi commented Dec 9, 2025

Uh oh!

Uh oh!

Uh oh!

yhirose Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Tachi107 commented Dec 9, 2025 via email

Uh oh!

mikomikotaishi commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikomikotaishi commented Dec 11, 2025

Uh oh!

Tachi107 commented Dec 11, 2025 via email

Uh oh!

Tachi107 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mikomikotaishi Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Spixmaster commented Dec 8, 2025 •

edited

Loading

mikomikotaishi commented Dec 8, 2025 •

edited

Loading

Spixmaster commented Dec 8, 2025 •

edited

Loading

mikomikotaishi commented Dec 8, 2025 •

edited

Loading

mikomikotaishi commented Dec 8, 2025 •

edited

Loading

yhirose Dec 9, 2025 •

edited

Loading

mikomikotaishi commented Dec 9, 2025 •

edited

Loading

mikomikotaishi Dec 11, 2025 •

edited

Loading