Deutsch Indonesia Tiếng Việt فارسی English Italiano Türkçe ไทย Español Polski Русский 日本語 Français Português العربية

Document Splitting and Page Extraction with C#

Wordize for .NET is a professional solution for programmatically splitting documents into parts. The Splitter class enables C# developers to divide documents into pages, sections, page ranges using various criteria. Programmatic document splitting saves time when processing large files, simplifies collaborative document work, and optimizes document workflows for printing and distribution.

When splitting documents into parts, the API automatically generates unique names for each part following the {filename}_{index}.{extension} pattern, where {filename} is the base output file name, {index} is the sequential part number starting from 1, and {extension} is the output file format extension.

Key Features:
  • Split by pages - the Splitter.Split() method with SplitCriteria.Page parameter creates separate documents for each page of the source file
  • Split by sections - using SplitCriteria.SectionBreak to divide documents by section boundaries
  • Split based on heading styles - the SplitCriteria.Style parameter with specified SplitStyle allows splitting documents by paragraphs with specific heading styles
  • Extract page ranges - the Splitter.ExtractPages() method extracts specified pages or page ranges from documents
  • Remove blank pages - the Splitter.RemoveBlankPages() method removes empty pages to improve readability and reduce file size
  • Fluent API support for splitting operations through SplitterContext and method chaining, enhancing C# code clarity
Use Cases:
  • Page extraction for selective processing
  • Document preparation for printing
  • Collaborative document work
  • Data archiving and restructuring
  • Content preparation for web publishing
  • Removal of blank pages and unnecessary content

Test the programmatic document splitting capabilities using the interactive online demo presented on this page. Upload your document, select the splitting criteria, run the operation, and download the resulting parts. The provided C# code snippet using the Splitter class is ready for use in your .NET project.

C#
Run code
Upload document you want to split
Select output format from the list
using Wordize.Splitting;

Splitter.Split("Input.docx", "Output_page.docx", 
    new SplitOptions()
    { 
        SplitCriteria = SplitCriteria.Page
    }
);
using Wordize.Splitting; Splitter.Split("Input.docx", "Output_page.docx", new SplitOptions() { SplitCriteria = SplitCriteria.Page } ); using Wordize.Splitting; //IMAGE Splitter.Split("Input.docx", "Output_page.docx", new SplitOptions() { SplitCriteria = SplitCriteria.Page } );
using Wordize.Splitting;

Splitter.Split("Input.docx",  "Output_section.docx", 
    new SplitOptions()
    { 
        SplitCriteria = SplitCriteria.SectionBreak
    }
);
using Wordize.Splitting; Splitter.Split("Input.docx", "Output_section.docx", new SplitOptions() { SplitCriteria = SplitCriteria.SectionBreak } ); using Wordize.Splitting; //IMAGE Splitter.Split("Input.docx", "Output_section.docx", new SplitOptions() { SplitCriteria = SplitCriteria.SectionBreak } );
Run code

How to programmatically split PDF, Word, HTML into parts

  1. Connect Wordize for .NET to your C# project
  2. Call the Splitter.Split() method, specifying the document, base output file name, and SplitOptions with splitting criteria as parameters. Formats will be determined automatically based on file extensions
  3. Get the split files according to specified parameters
5%