Author Topic: Slightly OT: How to convert Japanese subtitles from idx/sub to srt  (Read 675 times)

Offline bbratzky

  • Member
  • Posts: 3
Hi, I have a problem that maybe you guys can answer for me.

I have a DVD of a stop-motion Czech film only released in Japan, so it only has Japanese subtitles. What I want to do is to capture the subtitles into an .srt file and then run the text through Google Translate to at least get a rough idea of what's going on.

I can get the subtitles into the idx/sub files, but from there, I'm having trouble getting them to srt. Using Subresync, I am asked to identify every character, but since I don't know Japanese, I don't know what to type.

I have also tried using Subrip to rip them into .bmp files, and then run them through an online OCR program. However, the character recognition seems pretty subpar, and there's something like 400 separate .bmp files for this film, so I'm not sure if this is the way to go either.

Do you have any advice? Is there any program out there that can convert idx/sub files into .srt files as Japanese characters? This is slightly off-topic since the particular film I'm currently trying to do this with is not anime (it's European animation), but this will come in handy for me in the future as I try ripping my collection of Japanese-released DVDs.

Thanks to anyone who can help!

Offline megido-rev.M

  • Member
  • Posts: 16121
Re: Slightly OT: How to convert Japanese subtitles from idx/sub to srt
« Reply #1 on: May 25, 2013, 12:05:34 AM »
What about just viewing raw text? Do the characters display as rectangles in notepad?

Offline xgraphy

  • Staff
  • Member
  • Posts: 1303
Re: Slightly OT: How to convert Japanese subtitles from idx/sub to srt
« Reply #2 on: May 25, 2013, 04:33:37 PM »
stop-motion Czech film only released in Japan
If you plan to upload these here, you are going to need to elaborate further on this; because your efforts are likely to end up in vein from this description.


For help please refer to the wiki and search the help forum!

PM is through my tracker profile only
 


Offline bbratzky

  • Member
  • Posts: 3
Re: Slightly OT: How to convert Japanese subtitles from idx/sub to srt
« Reply #4 on: May 25, 2013, 11:43:16 PM »
What about just viewing raw text? Do the characters display as rectangles in notepad?

Well, when converted to .sub/.idx, the .idx file is just metadata (the timing of the subtitles, the format, etc.), while the .sub file is not in plain text, so if you open it with Notepad, you see a bunch of gibberish. Likewise, if I play the movie in VLC with the subtitles showing, I can't just copy and paste the subtitles--that doesn't work. If there's a way just to rip the Japanese subtitles into plain text, please let me know. I'd love to do that!

If you plan to upload these here, you are going to need to elaborate further on this; because your efforts are likely to end up in vein from this description.

No, like I said in my original post, this is a Czech film, which is why it's "slightly OT". This particular film is solely for my own enjoyment, and I don't plan on uploading this film here. But  knowing how to do this process will come in handy as I try ripping some Japanese DVDs that I have which have only Japanese subtitles. So, no, this film won't be upped here, but as I learn how to do this, then I can share some relevant anime material with all of you :)

Offline Al_Sleeper

  • Member
  • Posts: 6586
Re: Slightly OT: How to convert Japanese subtitles from idx/sub to srt
« Reply #5 on: May 26, 2013, 12:50:42 AM »
.sub/.idx DVD subs are perfectly fine for BakaBT, especially Japanese subs.

Offline wizisi2k

  • Member
  • Posts: 150
Re: Slightly OT: How to convert Japanese subtitles from idx/sub to srt
« Reply #6 on: May 26, 2013, 12:52:29 AM »
to OCR them which is what you are basically asking requires the Japanese language pack installed, a solid knowledge of the kanji, a Japanese virtural keyboard (for OCR'ing said kanji), and patience.  I tried it once with Gundam Unicorn volume 1.  Because of my not knowing Japanese, I was screwed because I didn't know the kanji that was the first OCR character.  You could of course use romanji to type out the character but that would be a little more problematic when it comes to a TL.  I doubt .srt can suppourt the full kanji character set.  Either way, good luck.