Home
Science
Science News
Giant Index Comprising Millions of Research Papers Released Online for Free

Giant Index Comprising Millions of Research Papers Released Online for Free

The General Index, created by American archivist Carl Malamud, was released earlier this month.

By Edited by Gadgets 360 Newsdesk | Updated: 28 October 2021 14:51 IST

Giant Index Comprising Millions of Research Papers Released Online for Free

Photo Credit: Unsplash/ Patrick Tomasso

The new online database helps index millions of research papers for easy browsing

Click Here to Add Gadgets360 As A Trusted Source

Highlights

The General Index has been developed by archivist Carl Malamud
The index comprises billions of research papers
The General Index is free for all

With so much research getting published every day all over the world, a super-smart search engine has become essential to help parse through seemingly endless scores of academic papers. Faced with the challenge, a technologist has found a way to unlock the world's research papers for easier computerised analysis. He has released an index of some 107.2 million journal articles online, including many paywalled research papers, totaling 38TB of data in its uncompressed form.

The General Index, created by American archivist Carl Malamud, was released on October 7 and is free to use. The index holds over 355 billion sentence fragments and words listed next to articles in which they appear. “It is an effort to help scientists use software to glean insights from published work even if they have no legal access to the underlying papers,” Malamud told Nature journal.

The primary objective of this index is to help with text mining, a process of using computers to quickly scan millions of data points to find references to something specific. Humans can't possibly read data from millions of journal articles, but a computer programme connected to the General Index can.

Do Outdoors Air Purifiers Actually Help Curb Air Pollution?

A set of researchers, who have had early access to the index, termed it as a big development. Gitanjali Yadav, a computational biologist at the University of Cambridge, UK, who studies volatile organic compounds emitted by plants, said this index will help researchers in accessing many research papers that already existed but were previously lost somewhere. Researchers were earlier restricted to mining only open-access papers or those that they had subscribed to. But this index will be of great help to them.

Malamud said his index contains only snippets up to five words long, so releasing it does not breach publishers' copyright restrictions.

What's most interesting about Apple's new MacBook Pros, M1 Pro and M1 Max silicon, AirPods (3rd Generation), and Apple Music Voice plan? We discuss this on Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.

Affiliate links may be automatically generated - see our ethics statement for details.

Comments

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.