Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Brand Logo

agnos.is Forums

  1. Home
  2. Ask Lemmy
  3. Are there any tools I can use for translating a ~400 pages scanned book?

Are there any tools I can use for translating a ~400 pages scanned book?

Scheduled Pinned Locked Moved Ask Lemmy
18 Posts 9 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • M This user is from outside of this forum
    M This user is from outside of this forum
    [email protected]
    wrote on last edited by [email protected]
    #1

    Situation: I got a scanned book that I'd like to read that is in chinese and has no available translation. I really want to read it, because it would probably help a lot with my university project.

    What I tried: tried creating a version with ocr to get a text layer and use some translation tool on it, but found no way to make the ocr text visible. I also tried this tool, but the ocr didn't work for me, and I found no way to use it with some local model

    Have any of you ever done a similar task? I'd appreciate any kind of suggestions and tips.

    B ? L burgerbaron@piefed.socialB G 6 Replies Last reply
    11
    • M [email protected]

      Situation: I got a scanned book that I'd like to read that is in chinese and has no available translation. I really want to read it, because it would probably help a lot with my university project.

      What I tried: tried creating a version with ocr to get a text layer and use some translation tool on it, but found no way to make the ocr text visible. I also tried this tool, but the ocr didn't work for me, and I found no way to use it with some local model

      Have any of you ever done a similar task? I'd appreciate any kind of suggestions and tips.

      B This user is from outside of this forum
      B This user is from outside of this forum
      [email protected]
      wrote on last edited by
      #2

      You can literally just feed the images into chat gpt at this point.

      T M M 3 Replies Last reply
      2
      • M [email protected]

        Situation: I got a scanned book that I'd like to read that is in chinese and has no available translation. I really want to read it, because it would probably help a lot with my university project.

        What I tried: tried creating a version with ocr to get a text layer and use some translation tool on it, but found no way to make the ocr text visible. I also tried this tool, but the ocr didn't work for me, and I found no way to use it with some local model

        Have any of you ever done a similar task? I'd appreciate any kind of suggestions and tips.

        ? Offline
        ? Offline
        Guest
        wrote on last edited by
        #3

        i did this with a chinese book, but have to check what i used.

        The translation was entirely readable.

        i think i used tesseract.

        No, GImagereader!

        that was it.

        tesseract was also very straightforward, but gimage reader had a GUI, and all I had to do was import the file and then click export and it did the whole thing.

        M 1 Reply Last reply
        6
        • M [email protected]

          Situation: I got a scanned book that I'd like to read that is in chinese and has no available translation. I really want to read it, because it would probably help a lot with my university project.

          What I tried: tried creating a version with ocr to get a text layer and use some translation tool on it, but found no way to make the ocr text visible. I also tried this tool, but the ocr didn't work for me, and I found no way to use it with some local model

          Have any of you ever done a similar task? I'd appreciate any kind of suggestions and tips.

          L This user is from outside of this forum
          L This user is from outside of this forum
          [email protected]
          wrote on last edited by
          #4

          notebooklm (Google)

          M 1 Reply Last reply
          1
          • B [email protected]

            You can literally just feed the images into chat gpt at this point.

            T This user is from outside of this forum
            T This user is from outside of this forum
            [email protected]
            wrote on last edited by
            #5

            This doesn't work after the pdf reaches a cert max size.

            B 1 Reply Last reply
            1
            • M [email protected]

              Situation: I got a scanned book that I'd like to read that is in chinese and has no available translation. I really want to read it, because it would probably help a lot with my university project.

              What I tried: tried creating a version with ocr to get a text layer and use some translation tool on it, but found no way to make the ocr text visible. I also tried this tool, but the ocr didn't work for me, and I found no way to use it with some local model

              Have any of you ever done a similar task? I'd appreciate any kind of suggestions and tips.

              burgerbaron@piefed.socialB This user is from outside of this forum
              burgerbaron@piefed.socialB This user is from outside of this forum
              [email protected]
              wrote on last edited by
              #6

              This is more intended for real time usage, but might work for you:

              https://github.com/Artikash/Textractor

              https://github.com/Crivella/ocr_translate

              I watch Macaw45 play full fledged Japanese retro RPG games using Textractor it'd probably be good for books too.

              M 1 Reply Last reply
              1
              • M [email protected]

                Situation: I got a scanned book that I'd like to read that is in chinese and has no available translation. I really want to read it, because it would probably help a lot with my university project.

                What I tried: tried creating a version with ocr to get a text layer and use some translation tool on it, but found no way to make the ocr text visible. I also tried this tool, but the ocr didn't work for me, and I found no way to use it with some local model

                Have any of you ever done a similar task? I'd appreciate any kind of suggestions and tips.

                G This user is from outside of this forum
                G This user is from outside of this forum
                [email protected]
                wrote on last edited by
                #7

                Which Google lens work? And take a picture of each page and feed it to the Google translate engine. It might be the easiest way.

                M 1 Reply Last reply
                0
                • T [email protected]

                  This doesn't work after the pdf reaches a cert max size.

                  B This user is from outside of this forum
                  B This user is from outside of this forum
                  [email protected]
                  wrote on last edited by
                  #8

                  Could just break it up into chapters or something, pretty easy to split a pdf.

                  1 Reply Last reply
                  0
                  • M [email protected]

                    Situation: I got a scanned book that I'd like to read that is in chinese and has no available translation. I really want to read it, because it would probably help a lot with my university project.

                    What I tried: tried creating a version with ocr to get a text layer and use some translation tool on it, but found no way to make the ocr text visible. I also tried this tool, but the ocr didn't work for me, and I found no way to use it with some local model

                    Have any of you ever done a similar task? I'd appreciate any kind of suggestions and tips.

                    andrew0@lemmy.dbzer0.comA This user is from outside of this forum
                    andrew0@lemmy.dbzer0.comA This user is from outside of this forum
                    [email protected]
                    wrote on last edited by
                    #9

                    If you find that OCR doesn't get you very far, maybe try a small vLM to parse PNGs of the pages. For example, Nanonets OCR will do this, although quite slow if you don't have a GPU. It will give you a Markdown version of the page, which you can then translate with another tool.

                    PaddleOCR might also be useful, since it focuses on Chinese, but it's more difficult to set up. To add to this, some other options are MinerU and MistralOCR (this is paid, but you can test it for free if you upload it in Mistral's library).

                    M 1 Reply Last reply
                    1
                    • B [email protected]

                      You can literally just feed the images into chat gpt at this point.

                      M This user is from outside of this forum
                      M This user is from outside of this forum
                      [email protected]
                      wrote on last edited by
                      #10

                      Every time I've done it, it's pretty bad. Ocr is much better.

                      1 Reply Last reply
                      0
                      • andrew0@lemmy.dbzer0.comA [email protected]

                        If you find that OCR doesn't get you very far, maybe try a small vLM to parse PNGs of the pages. For example, Nanonets OCR will do this, although quite slow if you don't have a GPU. It will give you a Markdown version of the page, which you can then translate with another tool.

                        PaddleOCR might also be useful, since it focuses on Chinese, but it's more difficult to set up. To add to this, some other options are MinerU and MistralOCR (this is paid, but you can test it for free if you upload it in Mistral's library).

                        M This user is from outside of this forum
                        M This user is from outside of this forum
                        [email protected]
                        wrote on last edited by
                        #11

                        That PaddleOCR looks very interesting. It will even extract images and formulas and somewhat preserve formatting in the output! I will try this one, even if takes more than a day to process is with my low end cpu. Thank you for the suggestion!

                        andrew0@lemmy.dbzer0.comA 1 Reply Last reply
                        1
                        • ? Guest

                          i did this with a chinese book, but have to check what i used.

                          The translation was entirely readable.

                          i think i used tesseract.

                          No, GImagereader!

                          that was it.

                          tesseract was also very straightforward, but gimage reader had a GUI, and all I had to do was import the file and then click export and it did the whole thing.

                          M This user is from outside of this forum
                          M This user is from outside of this forum
                          [email protected]
                          wrote on last edited by
                          #12

                          I used tesseract, but the output pdf didn't have visible text, and I found no way to change it. Maybe I don't know how to properly use it., or it's not intended to keep formatting.

                          ? 1 Reply Last reply
                          1
                          • burgerbaron@piefed.socialB [email protected]

                            This is more intended for real time usage, but might work for you:

                            https://github.com/Artikash/Textractor

                            https://github.com/Crivella/ocr_translate

                            I watch Macaw45 play full fledged Japanese retro RPG games using Textractor it'd probably be good for books too.

                            M This user is from outside of this forum
                            M This user is from outside of this forum
                            [email protected]
                            wrote on last edited by
                            #13

                            Thanks for the suggestions. That OCR_translate looks interesting. I will prioritize other recommended tools that seem to be more focused on books, but I bookmarked it for future needs.

                            1 Reply Last reply
                            1
                            • B [email protected]

                              You can literally just feed the images into chat gpt at this point.

                              M This user is from outside of this forum
                              M This user is from outside of this forum
                              [email protected]
                              wrote on last edited by
                              #14

                              I'm giving preference to open source tools, but that's a good thing to know, thanks

                              1 Reply Last reply
                              0
                              • L [email protected]

                                notebooklm (Google)

                                M This user is from outside of this forum
                                M This user is from outside of this forum
                                [email protected]
                                wrote on last edited by
                                #15

                                Well, I'm avoiding google, but I will keep it in mind as a last last resort, thanks

                                1 Reply Last reply
                                0
                                • G [email protected]

                                  Which Google lens work? And take a picture of each page and feed it to the Google translate engine. It might be the easiest way.

                                  M This user is from outside of this forum
                                  M This user is from outside of this forum
                                  [email protected]
                                  wrote on last edited by
                                  #16

                                  I'm not sure if it would be viable for a long book, and I'm also avoiding google, but thanks for helping. I got some nice suggestions in this thread.

                                  1 Reply Last reply
                                  0
                                  • M [email protected]

                                    I used tesseract, but the output pdf didn't have visible text, and I found no way to change it. Maybe I don't know how to properly use it., or it's not intended to keep formatting.

                                    ? Offline
                                    ? Offline
                                    Guest
                                    wrote on last edited by Guest
                                    #17

                                    I suggest the other program I mentioned instead, gImagereader.

                                    it's a frontend to tesseract and is more workable via its GUI and option menus.

                                    Load the file, execute the program.

                                    That's all I had to do for a successful OCR.

                                    1 Reply Last reply
                                    0
                                    • M [email protected]

                                      That PaddleOCR looks very interesting. It will even extract images and formulas and somewhat preserve formatting in the output! I will try this one, even if takes more than a day to process is with my low end cpu. Thank you for the suggestion!

                                      andrew0@lemmy.dbzer0.comA This user is from outside of this forum
                                      andrew0@lemmy.dbzer0.comA This user is from outside of this forum
                                      [email protected]
                                      wrote on last edited by
                                      #18

                                      Be wary that their docs are so and so. Nanonets OCR, Mistral OCR and MinerU will also extract formulas and images.

                                      One other model I forgot to mention is Docling. This one is quite quick to set up in a docker container, and will have a web interface ready to go where you can upload documents. This sort of follows the PaddleOCR pipeline, but also allows you to use vLMs.

                                      Good luck!

                                      1 Reply Last reply
                                      0
                                      Reply
                                      • Reply as topic
                                      Log in to reply
                                      • Oldest to Newest
                                      • Newest to Oldest
                                      • Most Votes


                                      • Login

                                      • Login or register to search.
                                      • First post
                                        Last post
                                      0
                                      • Categories
                                      • Recent
                                      • Tags
                                      • Popular
                                      • World
                                      • Users
                                      • Groups