Alchemy RecipeIntermediatestack

Academic paper digesting pipeline for research synthesis

Published

If you're a researcher, academic, or knowledge worker drowning in PDFs, you know the problem: reading papers sequentially takes weeks, but you need synthesis across dozens of them by Friday. Most people copy paragraphs into spreadsheets, manually summarise findings, then paste everything into a single document. It's tedious, error-prone, and defeats the purpose of having AI tools in the first place....... For more on this, see Academic research synthesis and citation-ready literature.... For more on this, see Academic literature review synthesis from research papers. For more on this, see Academic paper research and literature review synthesis.

What if your papers could analyse themselves? This workflow automates the entire pipeline: PDFs arrive in your inbox, get summarised by multiple AI tools in parallel, then compile into a structured synthesis document, all without you touching anything. You'll go from "I have 50 papers to read" to "here's the key findings across all 50" in minutes, not days.

This guide walks you through building an academic paper digesting pipeline using three specialised AI tools and one orchestration platform. By the end, you'll have a fully automated system that extracts insights, simplifies dense academic language, and generates comparison matrices across multiple papers with zero manual handoff.

The Automated Workflow

Architecture Overview

The workflow follows this sequence: incoming PDF files trigger an orchestration service, which distributes the PDF to three parallel summarisation tools, collects their outputs, then compiles them into a single markdown document that gets saved to cloud storage or emailed to you. The entire process takes 2-5 minutes per paper, depending on length and complexity.

We'll use n8n as the primary orchestration tool because it handles file downloads, parallel execution, and conditional logic without requiring code. If you prefer visual workflows with more integrations, Make (Integromat) works equally well. Zapier is simpler but less flexible for this use case.

Step 1: Setting Up the Trigger

Start by deciding how papers arrive. Common options are an email inbox with attachments, a shared Google Drive folder, or a webhook you control. For this example, we'll use a Google Drive folder as the trigger.

In n8n, add a "Google Drive" node set to watch for new files in a specific folder. Configure it like this:


Trigger: Google Drive > Watch Folder
Folder: /Research Papers
File Filter: .pdf
Polling Interval: 5 minutes

When a new PDF appears, n8n captures the file ID and creates a trigger event. From there, the workflow splits into three parallel branches, one for each summarisation tool.

Step 2: Parallel PDF Processing with Chat-with-PDF-by-Copilotus

This tool excels at extracting specific sections from papers. Create an n8n HTTP Request node that uploads the PDF and prompts the tool with a structured query.

First, you'll need the Copilotus API key. Register at copilotus.ai, generate an API token, and store it securely in n8n's credential manager.


POST https://api.copilotus.ai/v1/chat
Headers:
  Authorization: Bearer YOUR_API_KEY
  Content-Type: application/json

Body:
{
  "file_url": "{{ $node['Google Drive'].data.webViewLink }}",
  "message": "Summarise this paper in 150 words. Include: 1) Research question, 2) Methodology, 3) Key findings, 4) Limitations.",
  "response_format": "text"
}

Important: the file_url must be a direct download link, not a view link. Modify the Google Drive link from /view?usp=sharing to /export?format=pdf. n8n's Google Drive node usually handles this automatically, but verify the link works by pasting it into your browser.

The Copilotus tool returns a structured summary within 30-60 seconds. Store this response in a variable for later use.

Step 3: Parallel Processing with ExplainPaper

ExplainPaper specialises in breaking down dense academic language. It's particularly useful for papers with heavy mathematical notation or unfamiliar terminology.

Create a second HTTP Request node pointing to ExplainPaper's API:


POST https://api.explainpaper.com/v1/upload
Headers:
  Authorization: Bearer YOUR_EXPLAINPAPER_API_KEY
  Content-Type: application/json

Body:
{
  "file_url": "{{ $node['Google Drive'].data.fileLink }}",
  "analysis_type": "full",
  "focus_areas": ["abstract", "methodology", "results"]
}

ExplainPaper generates a simplified explanation of the paper's core concepts. This typically takes 45-90 seconds. The output includes highlighted sections, glossary definitions, and a plain-English summary suitable for readers without domain expertise.

Step 4: Parallel Processing with Resoomer

Resoomer is efficient at structural summarisation. It identifies the paper's main arguments and arranges them hierarchically.


POST https://api.resoomer.com/v1/summarize
Headers:
  Authorization: Bearer YOUR_RESOOMER_API_KEY
  Content-Type: application/json

Body:
{
  "url": "{{ $node['Google Drive'].data.fileLink }}",
  "language": "en",
  "percentage": 30,
  "type": "key-points"
}

The "percentage" parameter controls summary length; 30% usually yields 200-400 words per paper. Resoomer completes in 20-40 seconds.

Step 5: Aggregating Results

Once all three tools finish, you'll have three separate outputs. In n8n, use a "Wait" node to ensure all parallel branches complete before proceeding:


Wait for All Branches
Timeout: 300 seconds (5 minutes)
Execution Type: All Branches

Next, add a "Function" node to combine the three outputs into a single structured document:

const copilotus_summary = $node['Copilotus Request'].data.message;
const explainpaper_analysis = $node['ExplainPaper Request'].data.analysis;
const resoomer_summary = $node['Resoomer Request'].data.summary;

const combined = `
## Summary from Copilotus
${copilotus_summary}

## Simplified Explanation (ExplainPaper)
${explainpaper_analysis}

## Key Points (Resoomer)
${resoomer_summary}

Generated: ${new Date().toISOString()}
`;

return { document: combined };

Step 6: Storage and Output

Save the combined analysis to Google Drive or email it to yourself. For email, use n8n's Gmail node:


Gmail: Send Email
To: your-email@example.com
Subject: Paper Analysis - {{ $node['Google Drive'].data.name }}
Body: {{ $node['Function'].data.document }}

Alternatively, save to a Google Docs document or Notion database for searchability across multiple papers. If you're building a synthesis document that accumulates papers, use a "Google Sheets" node to log metadata (paper title, date analysed, tool confidence scores) in a comparison table.

Complete n8n Workflow JSON

Here's a simplified structure you can import directly into n8n:

{
  "nodes": [
    {
      "name": "Watch Google Drive Folder",
      "type": "n8n-nodes-base.googleDrive",
      "operation": "watch",
      "folderPath": "/Research Papers"
    },
    {
      "name": "Copilotus Request",
      "type": "n8n-nodes-base.httpRequest",
      "method": "POST",
      "url": "https://api.copilotus.ai/v1/chat",
      "runAfter": ["Watch Google Drive Folder"]
    },
    {
      "name": "ExplainPaper Request",
      "type": "n8n-nodes-base.httpRequest",
      "method": "POST",
      "url": "https://api.explainpaper.com/v1/upload",
      "runAfter": ["Watch Google Drive Folder"]
    },
    {
      "name": "Resoomer Request",
      "type": "n8n-nodes-base.httpRequest",
      "method": "POST",
      "url": "https://api.resoomer.com/v1/summarize",
      "runAfter": ["Watch Google Drive Folder"]
    },
    {
      "name": "Wait for Completion",
      "type": "n8n-nodes-base.wait",
      "waitType": "execOnce",
      "runAfter": ["Copilotus Request", "ExplainPaper Request", "Resoomer Request"]
    },
    {
      "name": "Aggregate Results",
      "type": "n8n-nodes-base.function",
      "runAfter": ["Wait for Completion"]
    },
    {
      "name": "Send Email",
      "type": "n8n-nodes-base.gmail",
      "operation": "sendEmail",
      "runAfter": ["Aggregate Results"]
    }
  ]
}

The Manual Alternative

If you want more control over which tools analyse each paper, or you need to pick specific sections for synthesis rather than full summaries, skip automation and use the tools directly.

Open Chat-with-PDF in your browser, upload a paper, and ask targeted questions: "What is the statistical significance of the main result?" or "List all limitations mentioned by the authors." ExplainPaper lets you click on sentences to get instant definitions and contextual explanations. Resoomer gives you a summarisation slider so you can adjust length on the fly.

This approach is slower, typically 10-15 minutes per paper, but it's more interactive and suitable when you need to verify claims or dig into methodology. Use manual processing for papers that are especially relevant to your research, and reserve automation for initial screening across a larger batch.

For synthesis across papers, create a spreadsheet with columns for author, year, key findings, and methodology. Manually fill it in from your individual paper analyses. This takes longer but creates a reusable comparison matrix you can sort and filter.

Pro Tips

Handling Rate Limits and Timeouts

All three API tools have rate limits. Copilotus allows 10 requests per minute on the free plan; ExplainPaper and Resoomer are more generous at 50 per minute. If you're processing many papers at once, add a delay between requests in n8n using the "Wait" node set to 6 seconds. This prevents hitting rate limits:


Wait Node
Time: 6 seconds
Between Iterations

If a request fails, add an error handler that retries up to three times with exponential backoff. In n8n, right-click a node and select "Add Error Handler", then chain another "HTTP Request" with the same parameters.

Improving Summary Quality

The three tools produce different output styles. Copilotus is structured and factual; ExplainPaper is accessible and beginner-friendly; Resoomer is concise and hierarchical. If you're working in a specific discipline, customise the prompts sent to each tool. For example, in medical research, ask Copilotus to highlight sample sizes and p-values. In computer science, ask it to identify algorithms and datasets.

Cost Optimisation

Don't run all three tools on every paper if you don't need to. Create a "triage" step that uses only Resoomer (the fastest and cheapest) first, then routes high-priority papers to ExplainPaper and Copilotus for deeper analysis. In n8n, add a conditional branch after Resoomer that checks the summary length or keywords to decide whether to proceed:

const summary = $node['Resoomer Request'].data.summary;
const shouldDeepen = summary.includes('novel') || summary.length < 150;
return { shouldDeepen };

Combining Results into a Synthesis Document

If you're accumulating analyses across multiple papers to build a literature review, use Notion or Google Docs as your synthesis hub. Create a database in Notion with properties for paper title, authors, year, summary, and key quotes. After all three tools complete, use n8n's Notion node to append a new entry:


Notion: Create Database Item
Database: Research Synthesis
Properties:
  Title: {{ $node['Google Drive'].data.name }}
  Authors: {{ $node['Copilotus Request'].data.authors }}
  Copilotus Summary: {{ $node['Copilotus Request'].data.message }}
  ExplainPaper Notes: {{ $node['ExplainPaper Request'].data.analysis }}
  Resoomer Points: {{ $node['Resoomer Request'].data.summary }}

Monitoring Workflow Failures

PDF processing sometimes fails if a paper is scanned (image-based) rather than text-based, or if it has unusual encoding. Set up error notifications so you know which papers didn't process:


Catch Error Handler
Send Slack Message: "Failed to analyse: {{ $node['Google Drive'].data.name }}"

This way you can manually review those papers or re-upload them in a different format.

Cost Breakdown

ToolPlan NeededMonthly CostNotes
Chat-with-PDF-by-CopilotusPro£15100 requests/month; overage £0.10 per request
ExplainPaperStandard£20Unlimited uploads; includes simplified explanations
ResoomerPlus£850 documents/month; very fast summarisation
n8n (self-hosted)Community (free)£0Run on your own server; requires basic DevOps knowledge
n8n (cloud)Starter£10Easiest setup; 1,000 executions/month included
Google Drive storage100GB plan£1.99If not already subscribed
Total (self-hosted)£43/monthBest for privacy and cost at scale
Total (n8n cloud)£54/monthBest for ease of use

The table assumes you're processing 10-15 papers weekly. If you process fewer, drop to Copilotus Free (£0, 10/month) and combine it with Resoomer Plus only. If you process 50+ papers weekly, self-hosting n8n saves money and eliminates API call limits.

More Recipes