Quickstart for PDF Accessibility Auto-Tag API (Node.js)

To get started using Adobe PDF Accessibility Auto-Tag API, let's walk through a simple scenario - taking an input PDF document and running PDF Accessibility Auto-Tag API against it. Once the PDF has been tagged, we'll provide the document with tags and optionally, a report file. In this guide, we will walk you through the complete process for creating a program that will accomplish this task.

Prerequisites

To complete this guide, you will need:

Node.js - Node.js version 10.13.0 or higher is required.
An Adobe ID. If you do not have one, the credential setup will walk you through creating one.
A way to edit code. No specific editor is required for this guide.

Step One: Getting credentials

1) To begin, open your browser to https://acrobatservices.adobe.com/dc-integration-creation-app-cdn/main.html?api=pdf-accessibility-auto-tag-api. If you are not already logged in to Adobe.com, you will need to sign in or create a new user. Using a personal email account is recommend and not a federated ID.

2) After registering or logging in, you will then be asked to name your new credentials. Use the name, "New Project".

3) Change the "Choose language" setting to "Node.js".

4) Also note the checkbox by, "Create personalized code sample." This will include a large set of samples along with your credentials. These can be helpful for learning more later.

5) Click the checkbox saying you agree to the developer terms and then click "Create credentials."

Project setup

6) After your credentials are created, they are automatically downloaded:

alt

Step Two: Setting up the project

1) In your Downloads folder, find the ZIP file with your credentials: PDFServicesSDK-Node.jsSamples.zip. If you unzip that archive, you will find a README file, your private key, and a folder of samples:

alt

2) We need two things from this download. The private.key file (as shown in the screenshot above, and the pdfservices-api-credentials.json file found in the samples directory:

alt

Note that that private key is also found in this directory so feel free to copy them both from here.

3) Take these two files and place them in a new directory. Remember that these credential files are important and should be stored safely.

4) At the command line, change to the directory you created, and initialize a new Node.js project with npm init -y

alt

5) Install the Adobe PDF Services Node.js SDK by typing npm install --save @adobe/pdfservices-node-sdk at the command line.

alt

6) Install a package to help us work with ZIP files. Type npm install --save adm-zip.

At this point, we've installed the Node.js SDK for Adobe PDF Services API as a dependency for our project and have copied over our credentials files.

Our application will take a PDF, Adobe Accesibility Auto-Tag API Sample.pdf (downloadable from here)) and tag its contents. The results will be saved in a given directory /output/AutotagPDF.

7) In your editor, open the directory where you previously copied the credentials. Create a new file, autotag-pdf.js.

Now you're ready to begin coding.

Step Three: Creating the application

1) We'll begin by including our required dependencies:

const PDFServicesSdk = require('@adobe/pdfservices-node-sdk');
Copied to your clipboard
const PDFServicesSdk = require('@adobe/pdfservices-node-sdk');

The first line includes the Adobe PDF Services Node.js SDK. The second third include Node's filesystem package as well as the package that will work with the ZIP file returned from the API.

2) Now let's define our input and output:

const INPUT_PDF = './Adobe Accessibility Auto-Tag API Sample.pdf';
const OUTPUT_PATH = './output/AutotagPDF/';

//Remove if the output already exists.
if(fs.existsSync(OUTPUT_PATH)) fs.unlinkSync(OUTPUT_PATH);

const TAGGED_PDF = OUTPUT_PATH + INPUT_PDF + "-tagged-pdf.pdf";
const TAGGING_REPORT = OUTPUT_PATH + INPUT_PDF + "-tagging-report.xlsx";
Copied to your clipboard
1const INPUT_PDF = './Adobe Accessibility Auto-Tag API Sample.pdf';
2const OUTPUT_PATH = './output/AutotagPDF/';
3
4//Remove if the output already exists.
5if(fs.existsSync(OUTPUT_PATH)) fs.unlinkSync(OUTPUT_PATH);
6
7const TAGGED_PDF = OUTPUT_PATH + INPUT_PDF + "-tagged-pdf.pdf";
8const TAGGING_REPORT = OUTPUT_PATH + INPUT_PDF + "-tagging-report.xlsx";

This defines what our output directory will be and optionally deletes it if it already exists. Then we define what PDF will be tagged. (You can download the source we used here.) In a real application, these values would be typically be dynamic.

3) Next, we setup the SDK to use our credentials.

const credentials = PDFServicesSdk.Credentials
        .serviceAccountCredentialsBuilder()
        .fromFile('pdfservices-api-credentials.json')
        .build();

// Create an ExecutionContext using credentials
const executionContext = PDFServicesSdk.ExecutionContext.create(credentials);
Copied to your clipboard
1const credentials = PDFServicesSdk.Credentials
2        .serviceAccountCredentialsBuilder()
3        .fromFile('pdfservices-api-credentials.json')
4        .build();
5
6// Create an ExecutionContext using credentials
7const executionContext = PDFServicesSdk.ExecutionContext.create(credentials);

This code both points to the credentials downloaded previously as well as sets up an execution context object that will be used later.

4) Now, let's create the operation:

// Create a new operation instance.
const autotagPDFOperation = PDFServicesSdk.AutotagPDF.Operation.createNew(),
    input = PDFServicesSdk.FileRef.createFromLocalFile(INPUT_PDF);

// Build autotagPDF options
const autotagPDFOptions = new PDFServicesSdk.AutotagPDF.options.AutotagPDFOptions.Builder()
    .shiftHeadings()
    .generateReport()
    .build();
autotagPDFOperation.setInput(input);
autotagPDFOperation.setOptions(options);
Copied to your clipboard
1// Create a new operation instance.
2const autotagPDFOperation = PDFServicesSdk.AutotagPDF.Operation.createNew(),
3    input = PDFServicesSdk.FileRef.createFromLocalFile(INPUT_PDF);
4
5// Build autotagPDF options
6const autotagPDFOptions = new PDFServicesSdk.AutotagPDF.options.AutotagPDFOptions.Builder()
7    .shiftHeadings()
8    .generateReport()
9    .build();
10autotagPDFOperation.setInput(input);
11autotagPDFOperation.setOptions(options);

This set of code defines what we're doing (an Auto-Tag operation), points to our local file and specifies the input is a PDF, and then defines options for the Auto-Tag call. PDF Accessibility Auto-Tag API has a few different options, but in this example, we're simply asking for a basic tagging operation, which returns the tagged PDF document and an XLSX report of the document.

5) The next code block executes the operation:

// Execute the operation
autotagPDFOperation.execute(executionContext)
    .then(result => {
        result.taggedPDF.saveAsFile(TAGGED_PDF);
        result.report.saveAsFile(TAGGING_REPORT);
    })
    .then(() => {
        console.log('Successfully tagged information in PDF.');
    })
    .catch(err => console.log(err));
Copied to your clipboard
1// Execute the operation
2autotagPDFOperation.execute(executionContext)
3    .then(result => {
4        result.taggedPDF.saveAsFile(TAGGED_PDF);
5        result.report.saveAsFile(TAGGING_REPORT);
6    })
7    .then(() => {
8        console.log('Successfully tagged information in PDF.');
9    })
10    .catch(err => console.log(err));

Example running at the command line

Here's the complete application (autotag-pdf.js):

const PDFServicesSdk = require('@adobe/pdfservices-node-sdk');

const INPUT_PDF = './Adobe Accessibility Auto-Tag API Sample.pdf';
const OUTPUT_PATH = './output/AutotagPDF/';

//Remove if the output already exists.
if(fs.existsSync(OUTPUT_PATH)) fs.unlinkSync(OUTPUT_PATH);

const TAGGED_PDF = OUTPUT_PATH + INPUT_PDF + "-tagged-pdf.pdf";
const TAGGING_REPORT = OUTPUT_PATH + INPUT_PDF + "-tagging-report.xlsx";

const credentials = PDFServicesSdk.Credentials
        .serviceAccountCredentialsBuilder()
        .fromFile('pdfservices-api-credentials.json')
        .build();

// Create an ExecutionContext using credentials
const executionContext = PDFServicesSdk.ExecutionContext.create(credentials);

// Create a new operation instance.
const autotagPDFOperation = PDFServicesSdk.AutotagPDF.Operation.createNew(),
    input = PDFServicesSdk.FileRef.createFromLocalFile(INPUT_PDF);

// Build autotagPDF options
const autotagPDFOptions = new PDFServicesSdk.AutotagPDF.options.AutotagPDFOptions.Builder()
    .shiftHeadings()
    .generateReport()
    .build();
autotagPDFOperation.setInput(input);
autotagPDFOperation.setOptions(options);

// Execute the operation
autotagPDFOperation.execute(executionContext)
    .then(result => {
        result.taggedPDF.saveAsFile(TAGGED_PDF);
        result.report.saveAsFile(TAGGING_REPORT);
    })
    .then(() => {
        console.log('Successfully tagged information in PDF.');
    })
    .catch(err => console.log(err));
Copied to your clipboard
1const PDFServicesSdk = require('@adobe/pdfservices-node-sdk');
2
3const INPUT_PDF = './Adobe Accessibility Auto-Tag API Sample.pdf';
4const OUTPUT_PATH = './output/AutotagPDF/';
5
6//Remove if the output already exists.
7if(fs.existsSync(OUTPUT_PATH)) fs.unlinkSync(OUTPUT_PATH);
8
9const TAGGED_PDF = OUTPUT_PATH + INPUT_PDF + "-tagged-pdf.pdf";
10const TAGGING_REPORT = OUTPUT_PATH + INPUT_PDF + "-tagging-report.xlsx";
11
12const credentials = PDFServicesSdk.Credentials
13        .serviceAccountCredentialsBuilder()
14        .fromFile('pdfservices-api-credentials.json')
15        .build();
16
17// Create an ExecutionContext using credentials
18const executionContext = PDFServicesSdk.ExecutionContext.create(credentials);
19
20// Create a new operation instance.
21const autotagPDFOperation = PDFServicesSdk.AutotagPDF.Operation.createNew(),
22    input = PDFServicesSdk.FileRef.createFromLocalFile(INPUT_PDF);
23
24// Build autotagPDF options
25const autotagPDFOptions = new PDFServicesSdk.AutotagPDF.options.AutotagPDFOptions.Builder()
26    .shiftHeadings()
27    .generateReport()
28    .build();
29autotagPDFOperation.setInput(input);
30autotagPDFOperation.setOptions(options);
31
32// Execute the operation
33autotagPDFOperation.execute(executionContext)
34    .then(result => {
35        result.taggedPDF.saveAsFile(TAGGED_PDF);
36        result.report.saveAsFile(TAGGING_REPORT);
37    })
38    .then(() => {
39        console.log('Successfully tagged information in PDF.');
40    })
41    .catch(err => console.log(err));

Next Steps

Now that you've successfully performed your first operation, review the documentation for many other examples and reach out on our forums with any questions. Also remember the samples you downloaded while creating your credentials also have many demos.

Introduction

Last updated 7/19/2023

Was this helpful?

Yes