@agucova
AI Safety Group Support Lead @ CEA (opinions my own)
https://agucova.dev$0 in pending offers
I'm a generalist, currently working in AI Safety fieldbuilding. I've also worked on software engineering, cost-effectiveness modeling, tech policy, and cybersecurity. See here for my background.
Agustín Covarrubias
10 months ago
Agustín Covarrubias
10 months ago
@pravsels Instead of building a platform, you could build tooling that integrates into existing academic publishing workflows. For example, Quarto is already a relatively successful attempt to improve technical publishing workflows, and I imagine you could make tooling that plugs into it (either through an extension or by outputting Quarto). This would leverage the existing community around Quarto and let you focus on what matters.
Agustín Covarrubias
10 months ago
@agucova having said all this, I think this project looks promising! I would be really excited to see a growing open-source ecosystem around tools for low-effort, high-quality visualization for explanations and research papers, and there's probably a very large audience of researchers (or communicators) that could adopt these tools if they're framed in the right way.
Agustín Covarrubias
10 months ago
Are you also hoping to revive the journal model? Or are you planning to focus exclusively on supporting self-published articles?
If the former, I'm worried that this proposal does not address the points made in the hiatus article against this theory of change. If the latter, I'm worried that jumping straight into tool development might distract you from the broader picture (where are the specific bottlenecks? which use cases do you want to support? how can you get researchers to adopt these workflows?)
As for the technical aspects of the proposal:
Creating such visually rich articles can be accelerated by building AI tools such as:
code completion foundation model (FM) for generating Math animations (using the Manim library created by 3Blue1Brown).
image generation multi-modal FM for going from sketches to diagrams. editing of generated diagrams with user-created masks and text prompts.
Is it really necessary to finetune a new model? Can't you work over the existing Manim capabilities of SOTA LLMs? (i.e. through RAG or creating a self-documenting system prompt). In my brief experience trying to generate Manim code, it seems LLMs make very straightforward mistakes that could be solved by just giving them better context on the library.
As for image generation: are you hoping to get the diagram images as output? Or is it your idea to use the model as part of a pipeline to go from a sketch to some kind of renderable spec? (i.e. Mermaid). Naively, I don't see how these could be reliable with current models, but I might be missing something.