Jump to content

Search the Community

Showing results for tags 'inline image text'.

  • Search By Tags

    Type tags separated by commas.
  • Search By Author

Content Type


Forums

  • General
    • Announcements and Site News
    • Administration
  • AutoIt v3
    • AutoIt Help and Support
    • AutoIt Technical Discussion
    • AutoIt Example Scripts
  • Scripting and Development
    • Developer General Discussion
    • Language Specific Discussion
  • IT Administration
    • Operating System Deployment
    • Windows Client
    • Windows Server
    • Office

Categories

  • AutoIt Team
    • Beta
    • MVP
  • AutoIt
    • Automation
    • Databases and web connections
    • Data compression
    • Encryption and hash
    • Games
    • GUI Additions
    • Hardware
    • Information gathering
    • Internet protocol suite
    • Maths
    • Media
    • PDF
    • Security
    • Social Media and other Website API
    • Windows
  • Scripting and Development
  • IT Administration
    • Operating System Deployment
    • Windows Client
    • Windows Server
    • Office

Find results in...

Find results that contain...


Date Created

  • Start

    End


Last Updated

  • Start

    End


Filter by number of...

Joined

  • Start

    End


Group


Member Title


Location


WWW


Interests

Found 1 result

  1. Hi, I used to batch convert word files (see attached file) to html and it worked fine but recently they've started putting some title page information in a different way. I use a RegExp after conversion to capture the text associated with, I believe are, inline images using: <img.*?001.*?002.*?alt="(.*?)"><img.*?(?=</p>) but this doesn't always work well and because these files come in large batches I'd like to get the text displayed in the html during the Word to html conversion stage is this possible or I'm I barking up the wrong tree? This is the information on the front page I wish to capture in the html: 2019] UKFTT 0717 (TC) TC07488 CAPITAL GAINS TAX – Contracts for land and properties to be built in Barbados – Properties not built – Disposal of contractual rights – Whether payments made under contracts prior to disposal of rights gave rise to losses for CGT purposes – Appeal allowed Thanks for any help/hints. here is a portion of my conversion script (thanks to those such as Water who've helped me with examples in the past) $oDoc = _Word_DocOpen($oWord, $processing & $sFileName) ;save file as: Select Case $convert2 = 1 Local $i_Format = 10 _Word_DocSaveAs($oDoc, $out & $fn & ".html", $i_Format) If @error <> 0 Then Exit MsgBox($MB_SYSTEMMODAL, "Word UDF: _Word_DocSaveAs Example", _ "Error saving the Word document." & @CRLF & "@error = " & @error & ", @extended = " & @extended) Case $convert2 = 2 Local $i_Format = 6 _Word_DocSaveAs($oDoc, $out & $fn & ".rtf", $i_Format) Case $convert2 = 3 Local $i_Format = 17 _Word_DocSaveAs($oDoc, $out & $fn & ".pdf", $i_Format) Case $convert2 = 4 Local $i_Format = 10 _Word_DocSaveAs($oDoc, $out & $fn & ".html", $i_Format) Local $i_Format = 6 _Word_DocSaveAs($oDoc, $out & $fn & ".rtf", $i_Format) Case $convert2 = 5 Local $i_Format = 10 _Word_DocSaveAs($oDoc, $out & $fn & ".html", $i_Format) Local $i_Format = 17 $iFileExists = FileExists($out & $fn & ".pdf") If $iFileExists Then ConsoleWrite("PDF exists: " & $out & $fn & ".pdf" & @CRLF) Else _Word_DocSaveAs($oDoc, $out & $fn & ".pdf", $i_Format) EndIf Case $convert2 = 6 Local $i_Format = 10 _Word_DocSaveAs($oDoc, $out & $fn & ".html", $i_Format) Local $i_Format = 6 _Word_DocSaveAs($oDoc, $out & $fn & ".rtf", $i_Format) Local $i_Format = 17 _Word_DocSaveAs($oDoc, $out & $fn & ".pdf", $i_Format) EndSelect _Word_DocClose($oDoc, $WdSaveChanges) TC07488.docx
×
×
  • Create New...