使用OLE从Powerpoint中获取文本

时间:2010-09-09 16:44:47

标签: perl powerpoint ole

我正在尝试使用Win32::OLE从当前演示文稿中获取幻灯片及其标题列表。

到目前为止,我可以

    my $powerpoint = Win32::OLE->GetActiveObject('Powerpoint.Application')
    my $ap = $$powerpoint { ActivePresentation } ;
    my $slides = $$ap { slides } ;

$slides只有Application Count Parent属性 任何人都可以指出我更进一步。

我意识到一个明显的答案就是不要使用Powerpoint。公司的dictat等等。

1 个答案:

答案 0 :(得分:5)

另请参阅我对Automating a Job at Work: Importing Powerpoint Bullet Text into an Excel Sheet的回答。

PowerPoint幻灯片没有特定的Title属性。它们具有Name属性,但这不是一回事。形状的占位符类型属性可以告诉您它是否是标题:

#!/usr/bin/perl

use strict; use warnings;
use Try::Tiny;
use Win32::OLE;
use Win32::OLE::Const qw( Microsoft.PowerPoint );
use Win32::OLE::Enum;

$Win32::OLE::Warn = 3;

my $ppt = get_ppt();

my $presentation = $ppt->Presentations->Open('test.ppt', 1);
my $slides = Win32::OLE::Enum->new( $presentation->Slides );

SLIDE:
while ( my $slide = $slides->Next ) {
    printf "%s:\t", $slide->Name;
    my $shapes = Win32::OLE::Enum->new( $slide->Shapes );
    SHAPE:
    while ( my $shape = $shapes->Next ) {
        my $type = $shape->PlaceholderFormat->Type;
        if ( $type == ppPlaceholderTitle
                or $type == ppPlaceholderCenterTitle
                or $type == ppPlaceholderVerticalTitle
        ) {
            print $shape->TextFrame->TextRange->text;
            last SHAPE;
        }
    }
    print "\n";
}

$presentation->Close;

sub get_ppt {
    my $ppt;

    try {
        $ppt = Win32::OLE->GetActiveObject('PowerPoint.Application');
    }
    catch {
        die $_;
    };

    unless ( $ppt ) {
        $ppt = Win32::OLE->new(
            'PowerPoint.Application', sub { $_[0]->Quit }
        ) or die sprintf(
            'Cannot start PowerPoint: %s', Win32::OLE->LastError
        );
    }

    return $ppt;
}

输出:

Slide1: Title Page Title
Slide2: Page with bullets
Slide3: Page with chart
Slide4:

显然,Slide4上没有标题。