Today, we move from "Building" to "Proving."
Testing a RAG system isn't just about checking for "Success" or "Failure." It’s about Grounding Truth. We need to ensure that when a warehouse manager asks about a "Wave," the AI isn't hallucinating a seaside vacation—it’s pulling directly from page 42 of your Oracle Warehouse Management Cloud User Guide.When a user asks a question like, "How do I close a manifest for an LTL shipment?", the consumption loop completes:
- Search: Select AI converts the user's question into a vector.
- Retrieve: It searches the "Meaning Map" for the top 3-5 chunks from the WMS manual that are most similar to the question.
- Augment: It sends the question PLUS those specific chunks to the LLM.
- Narrate: The LLM uses that "grounded" context to give you an answer that is 100% based on the WMS Guide, not its own imagination.
The Pre-Flight Check: Is the Engine Ready?
Before you issue your first natural language prompt, you must verify that the "Ingestion Pipeline" finished its heavy lifting. PDF processing is asynchronous; the database needs time to chunk the text and generate vectors.1. The Health Dashboard
VECTOR_$<INDEX_ID>). While the system manages this for you, manually providing a vector_table_name offers several architectural advantages:STATUS_TABLE column in the USER_CLOUD_PIPELINES view identifies an internal tracking table created by the database to manage a specific pipeline's operations.When you create a Vector Index using
DBMS_CLOUD_AI.CREATE_VECTOR_INDEX, Oracle automatically launches a background pipeline to handle the heavy lifting (downloading files, chunking text, and generating embeddings). The STATUS_TABLE is where the database records the granular progress of every single file in your Object Storage location.To truly stress-test our RAG implementation, we don't just ask simple questions. We test for Context, Process, and Synthesis. Here is the testing suite I use for our WMS pilot:
Level 1: Semantic Recall (The Definition)
Prompt:
SELECT AI narrate what is an 'LPN' and why is it central to WMS?What we’re testing: Can the AI find a specific definition buried in a 500-page doc?
Success Criteria: The response should mention "License Plate Number" and its role in tracking containers/pallets.
Result :
"An 'LPN' in the context of Warehouse Management System (WMS)
stands for License Plate Number. It is a unique barcode assigned to trailers
or containers to trace them within the warehouse. The LPN is central to WMS
because it helps in managing inventory within a Distribution Center by
tracking the movement of goods from receipt through shipment. By using LPNs,
the system can easily identify and manage the contents of containers or
pallets without the need to scan detailed information during processes like
picking or receiving. - oracle-warehouse-management-cloud-user-guide.pdf (https://objectstorage.us-phoenix-1.oraclecloud.com/nxxxxxx/b/WMS_PDF_BUCKET/o/oracle-warehouse-management-cloud-user-guide.pdf)" |
Level 2: The Process Chain (Step-by-Step)
Prompt:
SELECT AI narrate describe the step-by-step process for performing a 'Blind Cycle Count'.What we’re testing: Can the AI retrieve and sequence multiple "chunks" of text correctly?
Success Criteria: A numbered list that matches the manual's operational flow.
Result
|
"To perform a Blind Cycle Count in Oracle WMS Cloud, follow
these step-by-step instructions: 2. Scan the location that needs to be cycle counted. 3. Scan each LPN present in the location. 4. If you accidentally scan an LPN that does not exist in the
location, the system will prompt a message to "Rescan the same
LPN." Rescanning the same LPN will allow the system to make the
necessary inventory adjustments to record the new LPN in the location. -
oracle-warehouse-management-cloud-user-guide.pdf
(https://objectstorage.us-phoenix-1.oraclecloud.com/n/xxxxxx/b/WMS_PDF_BUCKET/o/oracle-warehouse-management-cloud-user-guide.pdf)" |
Level 3: Troubleshooting (Applied Logic)
Prompt:
SELECT AI narrate a shipment is stuck in 'In-Transit' status even though it has arrived. What should I check?What we’re testing: Does the RAG retrieve troubleshooting tips rather than just definitions?
Success Criteria: The AI provides 2-3 specific "checkpoints" mentioned in the manual.
Result
"To address the issue of a shipment being stuck in 'In-Transit'
status even though it has arrived, you should check the following: 1. Ensure that the ASN (Advanced Shipment Notification) has been
fully received and verified. The ASN status should be updated to 'Receiving
Complete' or 'Verified' after the receipt confirmation is generated and sent. - oracle-warehouse-management-cloud-user-guide.pdf (https://objectstorage.us-phoenix-1.oraclecloud.com/n/xxxxxx/b/WMS_PDF_BUCKET/o/oracle-warehouse-management-cloud-user-guide.pdf)" |
Level 4: The Audit Trail (Citations)
Prompt:
SELECT AI narrate what are the security requirements for RF device users? Provide citations.What we’re testing: Is the AI properly referencing its source?
Success Criteria: The response includes "Sources: [File Name]" or ideally specific page/section references.
Result
|
"The security requirements for RF device users include creating
new users through the "Users" screen where you can define the
username, password, facilities and companies access, user's role, default
group, language, default printer, and fixed equipment type. Additionally,
users must ensure that the RF gun is in the correct facility and can change
facilities by pressing Ctrl-F and typing in the Facility code. Sources: - oracle-warehouse-management-cloud-user-guide.pdf (https://objectstorage.us-phoenix-1.oraclecloud.com/nxxxxxx/b/WMS_PDF_BUCKET/o/oracle-warehouse-management-cloud-user-guide.pdf)" |
We have tested Same Prompt from on APEX UI ( tis is deigned by Oracle and available in Oracle Cloud) and outcome was wonderful as expected
Lessons from the Lab: What We Discovered
During our testing phase, we learned three critical lessons that every enterprise architect should keep in their back pocket:
1. The "Narrate" Action is Key
Standard SELECT AI might try to generate SQL. When testing RAG on documents, always use the narrate keyword. This tells the database: "Don't write code; tell me a story based on the retrieved facts."
2. Guarding against "Out-of-Bound" Answers
If the AI answers a question that isn't in your PDF, it’s using its "base knowledge" (hallucinating). To fix this, we refined our AI Profile attributes to increase the "grounding" requirement.
Conclusion: From Test to Production
Testing Select AI with RAG isn't just a technical exercise—it's about Building Trust. When Sarah the Lead Support Engineer (from our first blog) sees a response that perfectly mirrors the technical manual, she stops being a "searcher" and starts being an "expert."
By using the DBMS_CLOUD_AI package and monitoring our vector indexes, we’ve proven that the database isn't just a place to store data anymore. It’s a place to understand it.