DocDigitizer PowerShell Module

Commands Reference

vailable Commands

Command Description
Send-DocDigitizerDocument Send a PDF document for processing
Test-DocDigitizerConnection Test API connectivity
Get-DocDigitizerConfig View current configuration
Set-DocDigitizerConfig Set default configuration values
Get-DocDigitizerHelp Show help and usage examples

Send-DocDigitizerDocument

Sends a PDF document to the DocDigitizer API for OCR, classification, and data extraction.

Syntax

Send-DocDigitizerDocument
    -FilePath 
    [-DocumentId ]
    [-ContextId ]
    [-Pipeline ]
    [-LogLevel ]
    [-BaseUrl ]
    [-TimeoutSec ]
    [-SaveExtraction]
    [-OutputPath ]
    [-Depth ]

Parameters

Parameter Required Description
-FilePath Yes Path to the PDF file to process. Accepts pipeline input.
-DocumentId No Unique identifier for the document (GUID). Auto-generated if not provided.
-ContextId No Context identifier for grouping related documents (GUID). Auto-generated if not provided.
-Pipeline No Name of the pipeline to execute (e.g., MainPipelineWithOCR, MainPipelineWithFile).
-LogLevel No Response verbosity: Minimal, Medium, or Full.
-BaseUrl No Override the default API URL.
-TimeoutSec No Request timeout in seconds. Default: 300.
-SaveExtraction No Save result to {filename}_extraction.json in the same directory.
-OutputPath No Custom path to save the JSON result.
-Depth No JSON serialization depth. Default: 20.

Output

Returns a JSON string containing:

{
  "traceId": "ABC1234",
  "state": "PROCESSINGX",
  "pipeline": "MainPipelineWithOCR",
  "pageCount": 3,
  "filePath": "C:\\docs\\invoice.pdf",
  "documentId": "a1b2c3d4-...",
  "contextId": "e5f6g7h8-...",
  "output": {
    "extractions": [...]
  },
  "timers": {
    "DocIngester_Total": 1234.56,
    "DocWorker_Total": 1200.00
  }
}

Test-DocDigitizerConnection

Tests the connection to the DocDigitizer API.

Syntax

Test-DocDigitizerConnection
    [-BaseUrl ]
    [-Quiet]

Parameters

Parameter Required Description
-BaseUrl No Override the default API URL.
-Quiet No Returns only $true or $false instead of detailed status.

Output

Url       : https://apix.docdigitizer.com/sync
Connected : True
Response  : I am alive
Error     :
Latency   : 45

Get-DocDigitizerConfig

Displays the current configuration values for the DocDigitizer module.

Syntax

Get-DocDigitizerConfig

Output

BaseUrl         : https://apix.docdigitizer.com/sync
BaseUrlSource   : Default
Pipeline        : (not set - will use server default)
PipelineSource  : Default
LogLevel        : (not set - will use Minimal)
LogLevelSource  : Default
Timeout         : 300
TimeoutSource   : Default

Set-DocDigitizerConfig

Sets default configuration values for the DocDigitizer module.

Syntax

Set-DocDigitizerConfig
    [-ApiKey ]
    [-BaseUrl ]
    [-Pipeline ]
    [-LogLevel ]
    [-Timeout ]
    [-Persist]

Parameters

Parameter Required Description
-ApiKey No Your DocDigitizer API key. Get one at docdigitizer.com/contact
-BaseUrl No Default base URL for the API.
-Pipeline No Default pipeline name.
-LogLevel No Default log level: Minimal, Medium, or Full.
-Timeout No Default timeout in seconds (10-3600).
-Persist No Save configuration to PowerShell profile for future sessions.

Get-DocDigitizerHelp

Displays help information and usage examples for the module.

Syntax

Get-DocDigitizerHelp