PDF Q&A

Summary

This article describes a PDF Q&A implementation using Spring AI. Given a PDF file, this sample application loads its content into a vector store, then uses LLM to answer user's query based on the content.

The complete source is available on GitHub JavaAIDev/pdf-qa.

note

The example is updated to use Spring AI 1.0.0.

PDF Q&A is a classical example of using RAG. Content of PDF files are used to provide context for an LLM to answer queries.

Prerequisites

Java 21
A vector database. pgvector used in the sample.
- Use the Docker Compose file to start pgvector.
Ollama to run local models, or use OpenAI.
- bge-large model for text embedding
- phi3 model for chat completion.
- Pull models using ollama pull, like ollama pull bge-large.

Load PDF

The first step is to load content of the PDF file into the vector store. Spring AI provides PagePdfDocumentReader to read content of PDF files. The result of read method is List<Document>. This list of Documents is then passed to a TokenTextSplitter to split into chunks. Chunks are then saved to the vector store.

PDFContentLoader is a CommandLineRunner, so it imports the PDF content after the application starts. To avoid duplicated content, a marker file is created after first success import. Subsequent imports will be skipped.

Load PDF content
package com.javaaidev.pdfqa;

import java.nio.file.Files;
import java.nio.file.Path;
import org.slf4j.Logger;
import org.slf4j.LoggerFactory;
import org.springframework.ai.reader.pdf.PagePdfDocumentReader;
import org.springframework.ai.transformer.splitter.TokenTextSplitter;
import org.springframework.ai.vectorstore.VectorStore;
import org.springframework.boot.CommandLineRunner;
import org.springframework.core.io.FileSystemResource;

public class PDFContentLoader implements CommandLineRunner {

  private static final Logger LOGGER = LoggerFactory.getLogger(PDFContentLoader.class);
  private final VectorStore vectorStore;

  public PDFContentLoader(VectorStore vectorStore) {
    this.vectorStore = vectorStore;
  }

  public void load(Path pdfFilePath) {
    LOGGER.info("Load PDF file {}", pdfFilePath);
    var reader = new PagePdfDocumentReader(new FileSystemResource(pdfFilePath));
    var splitter = new TokenTextSplitter();
    var docs = splitter.split(reader.read());
    vectorStore.add(docs);
    LOGGER.info("Loaded {} docs", docs.size());
  }

  @Override
  public void run(String... args) throws Exception {
    var markerFile = Path.of(".", ".pdf-imported");
    if (Files.exists(markerFile)) {
      LOGGER.info("Marker file {} exists, skip. Delete this file to re-import.", markerFile);
      return;
    }
    load(Path.of(".", "content", "Understanding_Climate_Change.pdf"));
    Files.createFile(markerFile);
  }
}

Q&A

Implementing question and answering is quite simple using Spring AI. We can use the RetrievalAugmentationAdvisor provided in the spring-ai-rag module.

Given a RetrievalAugmentationAdvisor, all we need to do is including this advisor when sending requests. This can be done by using the defaultAdvisors method of ChatClient.Builder. Here I also add a SimpleLoggerAdvisor for logging.

REST controller
package com.javaaidev.pdfqa;

import com.javaaidev.chatagent.model.ChatAgentRequest;
import com.javaaidev.chatagent.model.ChatAgentResponse;
import com.javaaidev.chatagent.springai.ModelAdapter;
import org.springframework.ai.chat.client.ChatClient;
import org.springframework.ai.chat.client.advisor.SimpleLoggerAdvisor;
import org.springframework.ai.chat.messages.Message;
import org.springframework.ai.rag.advisor.RetrievalAugmentationAdvisor;
import org.springframework.http.codec.ServerSentEvent;
import org.springframework.web.bind.annotation.PostMapping;
import org.springframework.web.bind.annotation.RequestBody;
import org.springframework.web.bind.annotation.RestController;
import reactor.core.publisher.Flux;

@RestController
public class QaController {

  private final ChatClient chatClient;

  public QaController(ChatClient.Builder builder,
      RetrievalAugmentationAdvisor ragAdvisor,
      SimpleLoggerAdvisor simpleLoggerAdvisor) {
    this.chatClient = builder.defaultAdvisors(
        ragAdvisor,
        simpleLoggerAdvisor).build();
  }

  @PostMapping("/chat")
  public Flux<ServerSentEvent<ChatAgentResponse>> qa(@RequestBody ChatAgentRequest request) {
    return ModelAdapter.toStreamingResponse(
        chatClient.prompt()
            .messages(ModelAdapter.fromRequest(request).toArray(new Message[0]))
            .stream()
            .chatResponse());
  }

}

To create a RetrievalAugmentationAdvisor, we need to create a DocumentRetriever that retrieves documents from a vector store. This is done by using the built-in VectorStoreDocumentRetriever.

Create RetrievalAugmentationAdvisor
package com.javaaidev.pdfqa;

import org.springframework.ai.chat.client.advisor.SimpleLoggerAdvisor;
import org.springframework.ai.rag.advisor.RetrievalAugmentationAdvisor;
import org.springframework.ai.rag.retrieval.search.VectorStoreDocumentRetriever;
import org.springframework.ai.vectorstore.VectorStore;
import org.springframework.context.annotation.Bean;
import org.springframework.context.annotation.Configuration;

@Configuration
public class AppConfiguration {

  @Bean
  public RetrievalAugmentationAdvisor questionAnswerAdvisor(VectorStore vectorStore) {
    return RetrievalAugmentationAdvisor.builder().documentRetriever(
        VectorStoreDocumentRetriever.builder().vectorStore(vectorStore).build()
    ).build();
  }

  @Bean
  public SimpleLoggerAdvisor simpleLoggerAdvisor() {
    return new SimpleLoggerAdvisor();
  }

  @Bean
  public PDFContentLoader pdfContentLoader(VectorStore vectorStore) {
    return new PDFContentLoader(vectorStore);
  }
}

Deployments

This sample application can use Ollama or OpenAI. This application use deployment modules to include dependencies of Ollama and OpenAI.

Below is the shared configuration.

Shared configuration
spring:
  application:
    name: pdf-qa
  threads:
    virtual:
      enabled: true
    vectorstore:
      pgvector:
        initializeSchema: true
  datasource:
    url: jdbc:postgresql://localhost:5432/postgres
    username: postgres
    password: postgres
logging:
  level:
    org.springframework.ai.chat.client.advisor.SimpleLoggerAdvisor: DEBUG

Ollama

Below is the configuration of Ollama deployment module.

Ollama configuration
spring:
  config:
    import:
      - optional:classpath:config/shared.yaml
  ai:
    ollama:
      chat:
        options:
          model: "phi3"
          temperature: 0
      embedding:
        options:
          model: "bge-large"

OpenAI

Below is the configuration of OpenAI deployment module.

OpenAI configuration
spring:
  config:
    import:
      - optional:classpath:config/shared.yaml
  ai:
    openai:
      api-key: ${OPENAI_API_KEY:demo}
      chat:
        enabled: true
        options:
          model: gpt-4o
          temperature: 0.0
      embedding:
        enabled: true
        options:
          model: text-embedding-3-small

Test

Now we can test the REST API. Chat Agent UI is added to this project, so it can be used to test the application.

Start the server and use the UI (http://localhost:8080/webjars/chat-agent-ui/index.html) to test.

See the screenshot below.

UI test

Prerequisites​

Load PDF​

Q&A​

Deployments​

Ollama​

OpenAI​

Test​