01-30-2015, 11:26 PM | #1 |
Junior Member
Posts: 2
Karma: 10
Join Date: Jan 2015
Device: none
|
[Solved] How to extract webm videos from PDF
I'm really stuck here, trying to extract short webm video files from a PDF. I searched the entire day about it, trying different programs and methods, but nothing work.
With an EPUB is really easy, because I can open it with Calibre, and a temp folder is created with all the videos inside, so I tried to convert the PDF to EPUB with Calibre and then open it, but the conversion has no videos, just a lot of images with a couple of pixels inside (and it was REALLY slow, like 2 hours to convert it). Tried converting to other formats, but was exactly the same, and really slow too. Is there a better way to do this? Or I just need to keep trying programs and conversions until I success (or not)? Last edited by JackSPk; 02-02-2015 at 08:05 PM. Reason: Solved |
01-31-2015, 01:27 AM | #2 |
Curmudgeon
Posts: 629
Karma: 1623086
Join Date: Jan 2012
Device: iPad, iPhone, Nook Simple Touch
|
Inside PDF? Eww.
I've never seen video embedded in a PDF, but based on what PDF files usually look like under the hood, I'd expect the video data to be in the form of a giant blob of base64-encoded data, in all likelihood, wrapped with some sort of magic commands to tell the reader that the blob contains video data. If you can find that blob, you can probably copy the base64-encoded blob to a text file, then use a base64-decoding tool to extract the raw video. Of course, whether anything can then process the video file or not depends on whether they leave in the appropriate header data when they stick it in the PDF file or replace that data with equivalent data somewhere else in the PDF file (in which case you'd have to reconstruct the header bits). I couldn't begin to guess. It's worth a shot, though. |
Advert | |
|
02-02-2015, 08:05 PM | #4 |
Junior Member
Posts: 2
Karma: 10
Join Date: Jan 2015
Device: none
|
Solved!
Thank you for your responses. Fortunately I didn't need to decrypt the pdf manually or something like that.
What I do was install Mac OS X Mavericks on VirtualBox, download File Juicer, download the PDF again and then extract all of it's content. It took a lot of time because running mac os in an AMD is really tedious, but worked, so :) PS: Just in case that someone want to know the video format, according to File Juicer, is MP4. |
12-30-2015, 06:22 AM | #5 | |
Groupie
Posts: 154
Karma: 2160280
Join Date: Jul 2009
Device: iPad1 iOS 5.1.1b, iPhone 4
|
Quote:
|
|
Advert | |
|
01-08-2016, 09:38 AM | #6 | |
Fuzzball, the purple cat
Posts: 1,283
Karma: 11087488
Join Date: Jun 2011
Location: California
Device: iPad
|
Quote:
|
|
Tags |
extract, pdf, video, webm |
Thread Tools | Search this Thread |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Extract ISBN from PDF? | mdroberts | Calibre | 14 | 12-16-2016 08:32 AM |
Extract PDF text and store in custom column | diazlaz | Development | 2 | 12-30-2013 11:00 PM |
WebM: The Truly Open Video Format is here! | Sonist | News | 18 | 05-22-2010 05:14 AM |
Extract PDF from Palm PDB-file? | Tobago | 1 | 02-18-2010 08:32 AM | |
[REQ] Extract the first PDF page as image | Format C: | 2 | 02-09-2009 11:53 AM |