Tectonic: A modernized, complete, self-contained TeX/LaTeX engine

mmaxloh 5 days ago 43 commentsRead Article on tectonic-typesetting.github.io

FR version is available. Content is displayed in original English for accuracy.

⚡ Community Insights

Discussion Sentiment

58% Positive

Analyzed from 2969 words in the discussion.

Discussion (43 Comments)Read Original on HackerNews

kxhbr•2 days ago

Co-maintainer here.

Tectonic is a cool project, but hasn't seen any significant changes in a few years---and likely won't anytime soon. It seems we maintainers don't have the time and motivation to put serious work into Tectonic.

I haven't looked at the code in years (and thus may be wrong), but here's a quick overview:

Tectonic's code consists of thin bindings to /harfbuzz/graphite/etc and a vendored XeTeX (in C, with some tweaks to make the build easier), driven by Rust that tries to keep the TeX environment predictable and sane. A few components have been fully ported to Rust (bibtex, spx2html), but the project is very unfinished.

I've looked into the dark corners of TeX when I worked on Tectonic, and it is not pretty. TeX relies on a stack of evil hacks and esoteric behavior that is very hard to replicate, and very difficult to expose in an ergonomic way. This is true of the core system, and of many packages on CTAN.

A quick example: code highlighting does not work in Tectonic. The canonical solution is https://ctan.org/pkg/minted, which spawns a python process to style your code. Reproducibility is one of Tectonic's selling points, so we cannot replicate this behavior.

With https://typst.app/ as good as it is, there's little motivation to modernize TeX---especially considering the effort required. Typst _is_ modern TeX, and I'd rather spend my time there.

GiovanniP•2 days ago

While Typst appears to be popular, I think that TeXmacs, https://www.texmacs.org/, which is a program independent from both TeX and Emacs, is the kind of program that we need for writing: a fully WYSIWYG, fully structured document preparation system, in which you edit the structure of your document in a WYSIWYG way. When editing the structure on-screen, the user has no need to be aware that is doing so, as it looks like they are editing a text document; at the same time, the TeXmacs editor will guide the user to keeping a structured document.

xvilka•2 days ago

IIRC TeXmacs supports only quite limited subset of what LaTeX and TeX can do. Just like LyX, it could create new documents but will often fail opening ones that were created outside of it.

GiovanniP•2 days ago

I think it is so. As far as I know, there are no converters that can do that. A search with an LLM made me find https://arxiv.org/pdf/2605.16562, a paper describing the ArXiv conversion tool from LaTeX to HTML; here is a sentence from the abstract:

"corpus-scale conversion work aimed at 90% error-free HTML (currently 75%)"

although there may be issues that I do not understand or did not see (I looked at the paper very quickly) that make it more difficult for the authors than for the simplest possible translation.

leephillips•1 day ago

I disagree. Many people abhor WYSIWYG programs, myself included.

Having tried both TeXmacs and Typst¹, it’s easy for me to understand why Typst is rapidly gaining adoption and why, after — how long, over a decade? — essentially nobody uses TeXmacs.

[1] https://lwn.net/Articles/1037577/

IshKebab•1 day ago

I haven't tried TeXmacs (thanks to the truly abysmal name I assumed it was something to do with Emacs), but I have used LyX and it changed my opinion on WYSIWYG. It isn't a fundamentally flawed idea, it's just that most implementations of it are.

meling•2 days ago

I’ve been using Typst lately and it has been great. I’ve made an exam template for my university and made an export feature so that I could generate the exam in the json format that our online exam system (WISEflow) expects, with support for multiple choice and essay style questions.

It is so snappy and with great error messages. I encourage people to try it out. The typst tutorial is very approachable.

xvilka•2 days ago

I should note, it's still not on par feature-wise compared to TeX ecosystem, but it gets there with incredible speed. As for UX - it beats anything TeX-based ten times over.

__mharrison__•2 days ago

Typst is awesome. I've completed migrated and am happy to never have to touch LaTeX again. (Or have an AI write it.)

thomasfromcdnjs•2 days ago

Everytime I get Claude to write a pdf it uses tectonic.

BoingBoomTschak•2 days ago

I haven't used typesetting since my university days and updating my ModernCV CV, but it's also worth mentioning SILE (https://github.com/sile-typesetter/sile) for modern typesetting with minimal cruft.

I also found the very useful comparison website https://polytype.dev/ on the subject!

maxloh•5 days ago

Background of the project:

Tectonic is forked from the XeTeX TeX engine. However, the build process for XeTeX (and all other mainstream TeX engines) is extremely baroque:

1. The original Knuth sources are written in a language called WEB (which is basically used by nobody else)

2. Those sources are then patched with a series of “change files” due to restrictions on distributing modified sources

3. These patched files are then converted to the Pascal language using some custom processing scripts

4. That Pascal code is then converted to C code using additional custom tools

5. Those C files are then compiled against a set of extension files and libraries written natively in C and C++

One of the big motivators for the launch of the Tectonic project was to break out of this ridiculously unwelcoming development process. Tectonic is based on the C/C++ files that emerge from the XeTeX build process, and is gradually translating that code to modern Rust.

While the core TeX architecture and XeTeX are largely stable, they do, however, evolve with time. This repository recreates the final XeTeX C/C++ files from their source, so that modifications can be ported into the main Tectonic codebase.

https://github.com/tectonic-typesetting/tectonic-staging/blo...

gucci-on-fleek•2 days ago

> However, the build process for XeTeX (and all other mainstream TeX engines) is extremely baroque

(Disclaimer: I'm on the TeX Live team)

Yes, the build process is rather unusual internally, but this is pretty well insulated from the user. The "standard" build commands used by lots of other open source projects

  git clone […] && ./configure && make

should mostly just work, and give you a functioning TeX engine at the end. The full steps are listed at [0] if you're interested in more details though. And TeX Live is regularly built on essentially every platform imaginable [1], so it's pretty unlikely that you'll ever need to work with the low-level build system.

And only some of the engines require this complex build process; LuaTeX is the recommended engine these days [2], and it's written entirely in C (with a few C++ libraries), and it uses a standard autotools build process [3].

> One of the big motivators for the launch of the Tectonic project was to break out of this ridiculously unwelcoming development process.

TeX development is pretty welcoming in my personal experience: the first time that I built TL (~5 years ago), I was able to follow the official instructions without any problems, and I got a working TeX engine by the end of it. And it was only 2 or 3 years after that that I became an "official" member of the TL team, so I like to think that the development process is fairly welcoming.

But if you have any suggestions on what we can do better, please let me know, either by replying to this comment or to the email address linked in my profile.

> Tectonic is based on the C/C++ files that emerge from the XeTeX build process

I haven't looked at the Tectonic source, but the problem is that the C code generated by Web2C is fairly unreadable, so it's not really very usable as source code.

> While the core TeX architecture and XeTeX are largely stable, they do, however, evolve with time.

XeTeX is essentially frozen for the time being [4], unless someone steps up to maintain it. We are cautiously optimistic regarding LLMs though: they seem to be fairly decent at writing WEB code, and since even the TL development team only has 2 developers who are proficient in WEB, we'll take any help that we can get. (We don't have any non-trivial AI-written code yet, but it's definitely something that we're looking into)

[0]: https://tug.org/texlive/build.html

[1]: https://ftp.math.utah.edu/pub/texlive-utah/

[2]: https://www.latex-project.org/news/2024/11/01/issue40-of-lat...

[3]: https://gitlab.lisn.upsaclay.fr/texlive/luatex

[4]: https://tex.stackexchange.com/a/755239/270600

ks2048•2 days ago

So what does this fork do? No more WEB? No more Pascal? Does it produce exact results as XeTeX?

eigenblake•1 day ago

Other solutions get a lot right, but I don't know if I will ever use something other than LaTeX. Nothing gets math typesetting feeling so right. The justification and bin packing feels right. I might be esoteric for it. Maybe I care too much about things looking excellent. Maybe Claude is just good enough at debugging anything I'm facing. But I'm not interested in anything realtime/live if it means giving up on output quality. Overleaf has a live preview mode right there. I think a lot of the "problems with latex" are actually just problems with people developing packages in isolation in a decentralized manner. I don't care for LaTeX the language, or its implementation, but I do care for the output a whole lot, in a way that not the font alone or any approximation can reach. Maybe it's just inside baseball and people don't care about advanced typography. If I'm working on a piece of research that will carry my name on it, waiting for that compile is worth it. I still remember Scholarly Markdown. I might genuinely one day implement some other transpiler so I can always do my last pass in LaTeX. As long as your thing emits some decent IR, I'm happy.

xvilka•2 days ago

I have been using TeX/LaTeX for ages and today same issues hinder the user experience like multiple decades ago - cryptic error messages, complex pipeline, lack of the proper Unicode symbols support out of the box, and so on.

Nowadays, with Typst existing, it's vital for TeX ecosystem to solve these issues, since none of them are present in Typst. Projects like Tectonic would solve this for TeX, but they lack enough hands and (maybe) financial support.

Otherwise, using TeX only makes sense nowadays only if 1) you already have some templates 2) some features are still missing in Typst 3) you are just forced to use TeX/LaTeX for whatever reason.

gucci-on-fleek•2 days ago

(Disclaimer: I'm on the TeX Live team)

> cryptic error messages

These have somewhat improved recently, but I agree that they're still not great.

> complex pipeline

You can typically just run "latexmk --lualatex <filename.tex>" and your document should compile in a single step.

> lack of the proper Unicode symbols support out of the box

UTF-8 has been the default input encoding since 2018 [0], so character input should mostly just work. Using complex scripts (Arabic, Devanagari, etc.) requires XeLaTeX or LuaLaTeX, but LuaLaTeX is recommended for most documents anyways [1].

Now, you still won't be able to typeset arbitrary characters without any additional setup, but this is because there is no single font that contains all characters, and since mismatched fallback fonts usually looks bad, the (La)TeX developers do not want this to be the default. But

  \usepackage{fontspec}
  \setmainfont{Some Font with your Characters}

should be all that you need in most cases.

> Projects like Tectonic would solve this for TeX

All of these have already been fixed in TeX, except for the error messages, which would be impossible for Tectonic to fix.

(Background: the TeX engines give excellent error messages, and LaTeX gives good error messages for "expected" errors, but unexpected errors usually give a TeX engine error message unrelated to your LaTeX input, since LaTeX is internally implemented on top of TeX engine macros. So much like C++ template errors, it isn't really possible to fix this.)

> Otherwise, using TeX only makes sense nowadays only if

4) If you actually like TeX for some reason like I do :)

[0]: https://www.latex-project.org/news/latex2e-news/ltnews28.pdf...

[1]: https://www.latex-project.org/news/2024/11/01/issue40-of-lat...

fsh•2 days ago

I recently had a document break because I used umlauts together with the subfigs package. Apparently both use " characters internally and clash badly. This is not a particularly exotic use case.

gucci-on-fleek•2 days ago

Yeah, unfortunately lots of the third-party LaTeX packages are fairly poorly written. Which also applies to most other programming languages, but LaTeX is somewhat unique here since (1) approximately nobody makes money off of TeX, so even "important" packages are often volunteer-maintained, and (2) LaTeX2e has been around since the early 90s, so some of the popular packages have been unmaintained for over 20 years.

The core/official LaTeX code is really quite stable, but it's also very limited, so it's pretty reasonable to conflate it with the LaTeX ecosystem as a whole. But yes, the LaTeX Team is definitely aware of the problems caused by Babel shorthands (which is what " is in German), and they're trying to figure out some way to fix it without breaking other documents.

CJefferson•2 days ago

I'd much prefer bad looking unicode characters to just skipping them and having broken output honestly. This is currently the main reason I'm considering moving from LaTeX to typst -- just always having broken output as I use more unicode (maths symbols, emojis) in my documents.

gucci-on-fleek•2 days ago

> I'd much prefer bad looking unicode characters to just skipping them

That is an option that you can enable. If you're only using a few different languages (which is by far the most typical case), it's mostly automatic with Babel (by using "\babelfont"), but if you really want to support everything, you can use something like [0] (which is admittedly rather convoluted).

> just always having broken output as I use more unicode (maths symbols, emojis) in my documents.

Adding "\tracinglostchars=3" anywhere in your document (before the offending character) will turn any missing characters into an error message, and using the "--halt-on-error" command-line flag will make that error fatal. I've talked to the LaTeX Team and they'd really like it if they could make those the default, but they have 40 years of backwards compatibility to deal with. I've also talked to the Overleaf support team, and it's depressingly common for documents to compile with 50+ errors and for users to think that that's completely okay.

LaTeX is fairly similar to C here, because in both cases, the errors are nearly always important, a good document/program should compile with zero errors, yet thousands of old documents/programs do compile with errors and their users complain if you make them fatal by default.

[0]: https://tex.stackexchange.com/a/707031/270600

emil-lp•2 days ago

Using LaTeX makes sense because that's what all journals and conferences expect.

I'm writing two books, both in LaTeX.

I really don't get what the problem is.

mr_mitm•2 days ago

Using LaTeX is mostly fine, except for the endless compile times, useless error messages, lack of unicode, etc. like the GP said.

I'm maintaining an internally used LaTeX document class and the development experience is even worse. TeX has no concept of such avant-garde ideas like lists, dictionaries, or namespaces. Things break all the time, and sometimes only when you load three specific packages in a specific order because they all patch each other's routines. I still haven't completely groked the idea of fragile commands and expanding macros. Characters can change meaning depending on context, even the `comment` character (%) or the `escape` character (\), (and I believe even the curly braces) for example when used inside `\path{}` or `\url{}` [1]. It makes a difference whether you comment out line endings or not. The LaTeX3 syntax looks like a bad joke. I mean, look at it:

\ExplSyntaxOn \tl_set:Nn \l_tmpa_tl {A} \group_begin: \tl_set:Nn \l_tmpa_tl {B} \par value~inside~group:~\tl_use:N \l_tmpa_tl \group_end: \par value~outside~group:~\tl_use:N \l_tmpa_tl

\tl_set:Nn \l_tmpb_tl {A} \group_begin: \tl_gset:Nn \l_tmpb_tl {B} \par value~inside~group:~\tl_use:N \l_tmpb_tl \group_end: \par value~outside~group:~\tl_use:N \l_tmpb_tl \ExplSyntaxOff

????

Let's just let it retire and focus our efforts on Typst and pushing publishers to accept Typst.

[1] Just look at all these poor souls trying to achieve something as exotic as putting a URL with a percent sign inside a footnote: https://tex.stackexchange.com/questions/12230/getting-percen...

gucci-on-fleek•2 days ago

> except for the endless compile times, useless error messages, lack of unicode, etc.

Some of these have been fixed; see my sibling comment [0] for more details.

> TeX has no concept of such avant-garde ideas like lists, dictionaries, or namespaces. […]. The LaTeX3 syntax looks like a bad joke.

But that is in fact the entire purpose of LaTeX3. I agree that the syntax looks intimidating, but it's actually quite nice once you learn it, and it's written that way to provide namespacing in TeX. Similarly, LaTeX3 defines lists, dictionaries, and most other conventional datastructures.

> Things break all the time, and sometimes only when you load three specific packages in a specific order because they all patch each other's routines.

Hmm, well it depends. The LaTeX kernel and the TeX engines are more stable than nearly all other software, but the third-party packages do indeed break occasionally. But you see similar dynamics play out in most other ecosystems: JavaScript the language is incredibly stable and has excellent backwards compatibility, but if you use 50+ third-party packages, then things do indeed break occasionally.

> Characters can change meaning depending on context

Much like operator overloading in other languages, catcode changes in TeX can indeed be misused and are sometimes confusing, but they're also a pretty useful solution to problems that would otherwise be tricky to solve.

All this isn't to say that TeX doesn't have issues—I criticize LaTeX myself fairly frequently—but most of these are due to the fact that LaTeX is 40-year-old software built on a 50-year-old engine, and has remained backwards-compatible with documents throughout that entire time. And La(TeX) is slowly modernizing, so I'm fairly hopeful that things will continue to improve.

[0]: https://news.ycombinator.com/item?id=48515090

wolfi1•2 days ago

if I were to write a new TeX system I would use the attempts from the beginning of the 2000s, where they tried to use java und modularize the system, the systems being NTS [0] and ExTeX [1] [0] https://github.com/jamespfennell/new-typesetting-system [1] https://github.com/tex-other/extex

vincent-manis•1 day ago

Tectonic would be more convincing if the authors wrote their documentation in LaTeX, rather than Markdown (and I do know about Pandoc).

RestartKernel•2 days ago

A tool like this is sorely needed for LaTeX, and Tectonic is especially intuitive to embed into other applications, but the divergence of XeTeX from pdflatex makes it incompatible with most Overleaf projects. This is just an unfortunate ecosystem gripe, but for most workflows I end up having to reach for latexmk instead for this reason.

dhosek•2 days ago

Huh? Overleaf supports XeTeX out of the box. The LaTeX project, incidentally has been moving towards requiring LuaTeX as the engine (also supported in Overleaf).

The pdftex engine is pretty much a dead end these days and I would only recommend its use for compiling legacy documents.

That said, the biggest problem is nothing to do with the source code of TeX. The change file mechanism is pretty straightforward and there have been tools for decades to allow application of more than one change file against the source, although with the standardization on web2c in the build process as well as better cross-platform C compilation in 2026 vs 1982, there isn’t the proliferation of platform-dependent change files that there were in the 80s when people were compiling on Pascal compilers that supported different subsets of the language.

But as I was saying before I got into that digression, the source language isn’t the issue with TeX so much as the basic architecture which is highly coupled to the limitations of computers in the late 70s/early 80s when even 7-bit ASCII couldn’t be assumed to be consistent between systems¹. As much as I enjoy writing TeX macros and can do wonderful things with them that most people would consider dark magic, it’s a cursed way to do programming and has no parallel in any other programming paradigm.

⸻

1. The SAIL platform at Stanford where Knuth did the initial work, for example, had ↑ in the code space ASCII designates as ^, and IBM mainframes all used EBCDIC which has the complication of having | and ¦ as two separate characters both of which were typically mapped to | in EBCDIC to ASCII conversions with the reverse conversions arbitrarily choosing one of the two characters so that there was no guarantee that you’d get the expected character in your text file conversion² or your ASCII terminal controller.

2. Which is yet another reason why non-Unix operating systems would have distinct text and binary modes for opening files.

gucci-on-fleek•2 days ago

> so much as the basic architecture which is highly coupled to the limitations of computers in the late 70s/early 80s when even 7-bit ASCII couldn’t be assumed to be consistent between systems¹. As much as I enjoy writing TeX macros and can do wonderful things with them that most people would consider dark magic

Well LuaTeX solves both of these problems, but I'm pretty sure that you're already aware of that :)

> it’s a cursed way to do programming and has no parallel in any other programming paradigm

It's fairly similar to C preprocessor macros, and writing Mathematica code occasionally reminds me of TeX, but these aren't exactly the most flattering comparisons.

RestartKernel•2 days ago

Thanks for the additional information information, but with the "ecosystem gripe" I meant that most Overleaf projects are inadvertently designed for pdflatex simply because it's the default. No matter how much better other compilers are, pdflatex is the de facto standard in certain circles, even if few within these circles are aware of this (e.g. university laboratories), so I've had to begrudgingly switch back to latexmk for most projects to accommodate this.

Igor_Wiwi•2 days ago

In my project https://mdview.io I support Latex, but sometimes people share some weird types of syntax which looks like Latex but not rendered correctly. Haven't' found a good way to fix it, probably I will use some hybrid approach which I adopted for broken Mermaid diagrams (LLM + heuristics)

certik•2 days ago

Just wanted to say big thanks to the maintainers, I've been using tectonic the last couple years as my only LaTeX distribution, works everywhere (including macOS), it's available in conda-forge, so I can just have it as a dependency in my projects. Everything "just works", that's the best way to describe it.

Thank you for fully fixing LaTeX for me.

Lucasoato•2 days ago

Maybe a little off topic, but Kudos to the people who chose the name!

It reminds me of that very embarrassing dance we used to do around 2008 :)

https://it.wikipedia.org/wiki/Tecktonik