Integrations Intermediate

Confluence: Importing Wiki Pages for Analysis

5 min read Updated February 25, 2026
Import Confluence wiki pages into clariBI to analyze team documentation, meeting notes, and project data. This guide covers OAuth setup, page selection, sync configuration, and tips for getting useful insights from wiki content.

Overview

Confluence is where many teams keep their documentation, meeting notes, project plans, and knowledge bases. By connecting Confluence to clariBI, you can import this content and use AI analytics to extract insights, track project status, and identify patterns across your documentation.

This guide walks through connecting Confluence, selecting content to import, and getting started with analysis.

Confluence integration overview

Prerequisites

  • An active Atlassian Confluence account (Cloud or Data Center)
  • Admin access to the Confluence space(s) you want to import
  • Analyst role or above in your clariBI organization

Step 1: Connect Confluence

For Confluence Cloud

  1. Go to Data Sources in the clariBI sidebar.
  2. Click Add Source.
  3. Select Confluence as the data source type.
  4. Click Connect with Atlassian.
  5. Log in to your Atlassian account if prompted.
  6. Review the permissions clariBI is requesting:
  7. Read Confluence content -- Access to pages, blog posts, and attachments
  8. Read Confluence spaces -- Access to space metadata and structure
  9. Click Accept.
  10. After authorization, clariBI shows your Confluence site URL. Confirm it is correct.

For Confluence Data Center (Self-Hosted)

  1. Follow the same steps, but select Confluence Data Center in step 3.
  2. Enter your Confluence server URL (e.g., https://confluence.yourcompany.com).
  3. Provide an API token generated from your Confluence admin settings.
  4. Click Test Connection, then Save.

Confluence OAuth flow

Step 2: Select Spaces and Pages

After connecting:

  1. clariBI shows a list of all Confluence spaces you have access to.
  2. Select the spaces you want to import. You can choose:
  3. Entire space -- All pages and sub-pages in the space
  4. Specific pages -- Browse the page tree and select individual pages
  5. Pages with a specific label -- Import only pages tagged with certain Confluence labels
  6. Choose the content types to include:
  7. Pages -- Standard wiki pages (recommended)
  8. Blog posts -- Team blog entries
  9. Attachments -- Files attached to pages (PDFs, spreadsheets, images)
  10. Click Continue.

Filtering by Label

Label-based filtering is useful when you only want to analyze a subset of content. For example, if your team tags meeting notes with a "meeting-notes" label, you can import only those pages.

Step 3: Configure Sync Settings

  1. Sync frequency: Daily (recommended), weekly, or manual only.
  2. Historical depth: Import all pages or only pages created/modified in the last 30, 90, or 180 days.
  3. Content format: clariBI imports the text content of each page. Tables, headings, and lists are preserved. Images are referenced but not analyzed.
  4. Click Save and Sync.

The initial sync imports all selected content. Subsequent syncs pull only new and updated pages.

Using Confluence Data in clariBI

AI-Powered Analysis

With Confluence data imported, you can ask questions like:

  • "Summarize the key decisions from all meeting notes in the Engineering space this quarter"
  • "How many project status pages mention 'delayed' or 'at risk'?"
  • "What topics appear most frequently across our product documentation?"
  • "List all action items from the last 10 meeting notes"

Each query costs 1 AI credit.

Building Dashboards

Create a dashboard that tracks Confluence content:

  • Content volume widget -- Number of pages created per week/month
  • Top contributors -- Who is writing the most documentation
  • Label distribution -- Which labels are used most frequently
  • Recent updates -- A feed of recently modified pages

Generating Reports

Generate AI-powered reports from your Confluence data:

  • Quarterly documentation review -- Summarizes new and updated pages
  • Meeting notes digest -- Extracts decisions and action items across multiple meetings
  • Knowledge gap analysis -- Identifies topics with sparse documentation

Content Processing

What clariBI Imports

  • Page title and hierarchy (parent/child relationships)
  • Page body text in plain text format (HTML formatting is stripped, structure is preserved)
  • Author and last modified date
  • Labels/tags assigned to the page
  • Attachment metadata (file name, type, size) -- not the file contents

What clariBI Does Not Import

  • Page permissions (all imported content follows clariBI's access controls)
  • Confluence macros (rendered output is imported as text where possible)
  • Inline comments on pages
  • Page version history (only the current version is imported)

Managing the Connection

Re-syncing

To trigger a manual sync:

  1. Go to Data Sources and click the Confluence connection.
  2. Click Sync Now.
  3. clariBI pulls any new or updated pages since the last sync.

Handling Deleted Pages

If a Confluence page is deleted after import, clariBI marks it as "Source Deleted" in the data. The content remains in clariBI until you manually remove it or run a cleanup.

Disconnecting

  1. Go to Data Sources.
  2. Click the three-dot menu next to the Confluence connection.
  3. Select Disconnect.
  4. Previously imported data remains in clariBI but no new syncs occur.

To also remove imported data, select Disconnect and Remove Data.

Troubleshooting

"Insufficient Permissions" Error

  • Ensure your Confluence account has at least read access to the spaces you selected.
  • For Data Center, verify the API token has the correct permissions.

Missing Pages After Sync

  • Check that the pages are in a selected space or match the label filter.
  • Pages in restricted spaces (personal spaces or spaces with restricted permissions) may not be accessible to the API.
  • Run a manual sync and check the sync log for errors.

Large Spaces Take Too Long

  • If a space has thousands of pages, the initial sync may take 10-30 minutes. Subsequent syncs are faster because only changes are pulled.
  • Consider filtering by label to import only relevant content.

Related Articles

Integrations Intermediate

Google Analytics 4 Integration

Connect your Google Analytics 4 property to analyze website traffic and conversion data in clariBI.

2 min read

Still Need Help?

Can't find what you're looking for? Our support team is here to help you succeed with clariBI.