I asked galactica to write a blog post and the results weren't great
A few weeks ago, Meta AI announced Galactica1, a large language model (LLM) built for scientific workflows. The architecture for Galactica is a fairly vanilla transformer model, but with three interesting modifications to the training process.
First, the training corpus itself is comprised of scientific documents. These are mostly …