How's it going?
Scanlation Thread
Other urls found in this thread:
webcomicgamma.takeshobo.co.jp
mangaupdates.com
mangaupdates.com
webcomicgamma.takeshobo.co.jp
webcomicgamma.takeshobo.co.jp
webcomicgamma.takeshobo.co.jp
pastebin.com
pastebin.com
pastebin.com
cors-anywhere.herokuapp.com
pastebin.com
pastebin.com
pastebin.com
pastebin.com
comip.jp
pastebin.com
pastebin.com
nyaa.si
sukebei.nyaa.si
twitter.com
Also anyone know how to descramble these images?
Rectangular selection and a free afternoon.
But seriously, that was a topic of discussion in one of the recent threads and from what I gathered, you need to look for the descrambling function in the reader and see what it does.
How do I do that lol
It should look like this.
What series?
I am still waiting for somebody to scanlate this series since my nickname is "Zeus": mangaupdates.com
I've lost hope though...
>bizarre looking C-tier manga
yeah, no chance.
It looks bizarre, but also kind of fun in my opinion. I'm sure I'd read it.
Or maybe somebody could give me advice on how I could translate it for myself if I ever find raws of it.
Looking for manga to translate for no real reason.
what is this?
you could start by learning moon
Try Zeusu no tane please!
What is this? I'm pretty new to manga, sorry about that.
You could try Marimari Marigold.
I want to see an entire manga written in Touhokuben and/or Izumoben
does there yet exist such a thing
>you need to look for the descrambling function in the reader
Harder than it sounds, considering all those readers have been minified and pretty much obfuscated. I tried to look for the descrambling/loading function in the jump+ reader and I just got lost.
>How's it going?
Too damn lazy to start doing anything.
Anybody still using Share? Damn, I miss those scanned raws.
Well good thing that isn't the case for the one he linked
Just take a screenshot man
Would I save any money if I imported manga to someone in Tawian/Korea/HK to scan it there instead of shipping to yurop?
I knew I should have taken a JavaScript course
Instead I'm stuck in this Compiler hellhole in my college
He means the language
>I'm pretty new
ftfy
See this reader URL?
webcomicgamma.takeshobo.co.jp
Add data/0001.jpg for the first page:
webcomicgamma.takeshobo.co.jp
Add data/0001.ptimg.json to the reader URL to get the scrambling of the first page:
webcomicgamma.takeshobo.co.jp
Everything you need is in there, image dimensions, new and old coordinates to move the tiles. Just gotta write something up and you're done.
Rinse and repeat.
Typo, reader URL is this:
webcomicgamma.takeshobo.co.jp
But yeah, you should get the idea.
Probably will
That's confusing
This is probably the easiest I've ever encountered. If that's confusing I dunno how to help.
waiting for ripper user's reply
I've no idea what you mean by write something up
That means you open a text document (or IDE) and start writing an automated ripper in any programming language of your choice.
If you don't know any, you'll probably have to wait for someone to write something but that will depend on the mood of the anons in here.
>just write something looool
not a tech guy
You've got one hell of a jigsaw puzzle to solve, then.
Those coordinates confuse me
I'm more retarded than I thought
any tips on archiving stuff?
like with what image type to use, archive type, compression methods and the likes.
I don't mind bloating it up as long as it stays lossless and with least chance of getting corrupted.
I thought maybe it's possible to employ some heuristics to automatically detect descrambling functions or at least the moment when descrambling is complete and then just dump the detected canvas. Of course it wouldn't work on readers that divide page canvas into parts (only seen one site doing this) but it's a start I guess?
Or maybe just fuck it and use a headless browser and screenshot the fuck out of things
Seems easier to automatically detect the cutting grid and rearrange the parts.
This, and I'm not trying to be a fag, not much at least but, one thing is not be a complete moron when it comes to technology and other is expecting people to have specialized knowledge in something.
That would be possible if they use some scramlbing method that has overlapping parts like comicgum or the example in this thread.
But if they use a method which doesn't overlap, certainly not.
What if the page is almost all white? If there's a black dot inside a rectangle whose edges are all white and almost all other rectangles are completely white, how are you supposed to know where to place it?
You are fucked then. Add some simple UI to let user make decisions when the algorithm can't. If a human cannot say which part belongs where then it's fine either way.
Can somebody share the most recent ebookjapan ripping script?
inb4 nice try yahoo
You can easily monitor the canvas used for descrambling if you monkey patch canvas drawImage. However I think it would be pretty useless since you can't be sure which page is being currently descrambled.
it's better to intercept the assembled image further in the chain. Or rip the DOM like a barbarian if nothing else works.
Let's break down another scrambling function then. This one is a bit more complex, and a bit easier than the last one we did in one of these threads as it's not a mathematical formula for some easy cryptography technique. However, all this one is coordinates from the scrambled image superimposed onto a new image.
If we look at the JSON from it doesn't make that much sense. However, if we go and pretty print it, it gives us a nice little list. And from there, we can break down what's going on here.
The main item we'll be looking at is the coords array which has all the lines that start with "i:...". The two numerical values after the i are the coordinates in the scrambled image, so our first coordinate is (4, 4). That's 4 pixels down and 4 pixels right from the top-left corner. Then we have the width and height of what we're moving after the plus. So for the first value, it's 102 pixels wide and 150 pixels tall. Finally, we move this rectangle to the final coordinates in the descrambled image, (636, 0). Rinse and repeat for every array value. Someone can probably make a python script to do this in like 10 minutes.
I'm kinda into these DRM hacking lessons, can we make it a regular feature?
Ah, so the numbers after + were the width and height. I should've figured that out from how they were nearly the same in all of them.
I tried it on one image but I had to hard code the coordinates in because I couldn't get the JSON open for some reason
png, zipped
and then just use a backup solution that deduplicates, which means it should check hashes of blocks or the like like borgbackup
pastebin.com
Pretty simple implementation of what I said here, requires both the image file and the JSON, but it wouldn't be terribly difficult to grab both of those from a single URL.
I seem to do them every time I come into a thread, since "scanlation" is now How to Web Rip.
Treat the JSON as a Dict instead of a List. It's a pain in the ass to split the string though.
What do you all think of manga translated from Spanish to English?
better than nothing. it's not like your EOP readers can do anything about it.
You made a small error, in this case you can't simply use the image size of the input image (im.size) since it's different. You have to read it from the JSON.
This makes these two redundant:
>width = int(im.width)
>height = int(im.height)
(They would have been redundant in either case.)
And you simply change
>out = Image.new('RGB', im.size)
to
>out = Image.new('RGB', (data['views'][0]['width'], data['views'][0]['height']))
I honestly don't know why I browse these threads since I don't scanlate, but I am learning programming so this kind of stuff is very interesting.
Yeah I made the size mistake at first as well
>shitty horror manga
Of course it's scanlated by chillox
Does anyone know if rakuten has a subscription feature like ebj did? I always forget forget when a new release is due. I tried looking for one, but if it's there I'm too retarded to find it.
I was working off a previous script that the image size did matter, didn't exactly bother to make things super clean because fuck it, I'm doing this for free.
Look at automation with image manipulation, it gets fun.
pastebin.com
Updated the last script to rip entire chapters, just throw in the URL and page count. So for you'd do:
>python script.py webcomicgamma.takeshobo.co.jp
There are 31 pages total, that's why that's there.
Best way to install Ps?
Forgot image
You could have avoided the requirement of a manual max page input by reading the 'data-ptimg' div from the source.
I had the great idea last week of looking into decrypting Bookwalker's files and I'm at the end of my rope here. I have no idea how that user in the archive even got the database password out of it. Both loading the whole DecLib.dll in a program and creating only a function pointer to where I know the function I need begins throw two different kinds of memory access violations.
Debugging the Bookwalker app doesn't work either because it immediately spawns 3 separate processes that all seem to debug each other in a circle and kills itself so you can't attach your own debugger to any of them. I can see in the disassembled code how the app calls the dll functions, but I have no idea what black magic the devs incorporated to make it work. Process Manager also shows me the app doing some bizarre read accesses to the file after it's called. I don't suppose anyone here has looked into it before and has an idea of how to proceed?
I'm trying to gauge interest in a B00kwalker ripper, would people be interested in one that runs half on the client, half on the server (meaning no source code)? I've been thinking of making one for a while
>no source code
but why? do you want to make it a paid service, lol?
You do realize that any additional activity will put you straight into the crosshairs of the Japanese? It is best that you keep a low profile.
Is an offline solution out of the question?
No, I just don't want idiots ruining it for the rest of us. Let's imagine for a moment I post a BW ripper publicly. It would be a "secret" for less than a day, then it would spread, and then the BW devs would patch it, and then you would be fucked and all the time I've invested would go to waste.
This is not an exploit, this is for purchased items on your own account. And remember it is purely theoretical at this point.
For a public release, yes
even if it was server sided, they'd attack you like the dickens unless you somehow only ran it through tor.
even then, you'd get DDOS'd to absolute shit.
it'd be easier to attack you than to patch any exploits, by a country mile. You overestimate these sorts of businesses' tech turnover.
> running through tor
that's a given. Client program that starts a tor instance. And honestly, rate-limiting and countering ddos from the japs is really not that hard.
Well, if it works and can only download files that I already own to keep suspicion low, then I'm definitely interested. BW is the only site I've been able to use for raws and so far I've had to rely on automated stitching of screenshots from its app.
mangadex would like a word with you.
Hey I tried, they wouldn't listen. They have more important things to do like making fancy menus with anime girls
Japs aren't behind mangadex ddos
It rhymes with Missmanga
pastebin.com
Anything else you want? Since apparently I'm bending over backwards.
Sorry, I didn't mean to make you do it since I could have done it myself. Just wanted to point it out.
Does this work the same way for readers like the jump+ one?
Found something in my notes. Last password I have saved is ccs3319_bookwalker (likely changed since then). IIRC I used Rohitab API Monitor and logged all calls to wcslen function and saw it called with that string. It was called inside sqlcese40.dll.
You can use Process Hacker to force detach from the debugger, but I'm not sure if the app will run if you do that. The main process is .NET so you're better off using dnspy to decompile it once you dump it.
Keys inside the database are encrypted now, the decryption is likely performed by some native .dll.
Are there any caveats to just dumping the canvas for some of those readers? I've been steady ripping ebj with my little ghetto ripper, but the canvas size has gone down recently. I'm wondering if there's any funny business going on, like downscaling on the client, or if it's just some of the scans getting lower res.
EBJ doesn't have exclusive quality (higher resolution than other stores when it comes to Kodnasha) since February.
>just dumping the canvas
How did you get around the security errors?
Compare what you get with with scrambled images in the network panel.
What security errors?
Yeah, that's the previous database password. The new one is a much longer jumble of letters and symbols. I used JetBrains to decompile it and olly with scyllahide to get around its debugger detections, but I'll try those approaches. Maybe another decompiler will produce clearer code. The app comes with OpenSSL libraries, but it only uses those directly to generate private and public keys. The actual encryption and decryption is performed through DecLib.dll functions and the trail of encrypted keys that are used to decrypt keys to decrypt and deobfuscate the files is absurd now.
A ripper for that is a bit different. One was made in the last thread.
When external images are pasted onto a canvas, it gets marked as "dirty" and doesn't allow me to toDataUrl() or toBlob(). I could only get around that by modifying the site code to fetch the image with a cross-origin of "anonymous" and a CORS add-on.
You can modify the getter of HTMLImageElement's src property to proxy all links though something like cors-anywhere.herokuapp.com
*setter, of HTMLImageElement.prototype
Shit runs fine once you force detach debugger (didn't try attaching a debugger after that tho). You can even kill the processes, just leave the one that takes most RAM running, that's the real one. The database password: pastebin.com
Wow, that's an amazing hack. Thanks!
Maybe I should actually learn JS sometime instead of relying on StackOverflow and w3schools.
What kind of resources did you ripper anons go through before you knew enough to, well, dig through code/debuggers to make rippers?
What format do you like for your scripts? I've been just using Word, but I was wondering whether doing things in excel would make things easier for editors or not
Clear text files because it makes the editing easier and I use an automatic text placement script.
you will encounter non-windows users eventually
they will hate you
simple text would be the way to go anyway, since the formatting is all useless bullshit, and the one time i saw a script with arguments in notations i wanted to kill myself
Paint, I like to make them work for it.
>w3schools
Why would you use that instead of MDN?
It showed up first in Google? Idk what the best resources are, I'm only casually meddling in JS here and there.
what's stopping people from just programatically opening a browser window to a specific resolution and taking a screenshot
is it Windows? it's gotta be Windows, right?
well okay resolution -> window size
Depending on your aptitude, you could learn how to make a web ripper in a day for the easier pages. I barely know python or javascript, but I can still walk through them to understand them and then google anything I need to put together a solution.
You could do this very easily with Selenium automation. I assume the caveats would be resolution and fidelity.
What do you care?
>resolution
opening a window larger than your desktop resolution should be trivial.
>fidelity
the file should be reproduced exactly if it's at the correct resolution.
Good luck negotating with the viewers. Just rip the canvas like a sane human being.
a virtual mouse should work fine.
That works if the web viewer has a 1:1 zoom option, which Bookwalker doesn't have. Its 100% zoom level scales it to fit the window and it goes up in fixed intervals, meaning a page that's 1920 high will eventually be rendered at a resolution that's above 1920 high and if it's 1350 wide, it also won't fit onto a 1080p screen vertically at full height. Then there are releases at different sizes like 2048px or 1600px. One way or another it'll be rescaled from the original file and further degraded in quality.
scaling to fit the window no matter what is exactly what i was thinking of exploiting.
I just checked. You could probably rip losslessly from ebj that way since they put the original image in the canvas that scales up to its native resolution, but it would require manual cropping or at the very least manual setting of width and height for every page size. Still no idea why would you want to do it this way.
it would be more platform-agnostic, so you don't have to make rippers for every site as long as the comic is still viewable in the browser.
Like I said, you probably won't have so much luck with other viewers.
if the issue is finding the original resolution, that can be relatively manual, made easy by having a program incrementally scale the window.
being able to see the manga and being able to scale the image by changing the window size would be the only requirements for this to work.
meant for
The only thing easier than a .txt would be a .psd.
Oh, word? Are there any stores that get the good stuff still? I've searched around, but it's not too easy to tell what's what.
meant for excuse my haze
No, it's gone.
Yep, totally gone. Yahoo is intent on making our lives miserable
massive triggering everywhere so the usual
>Translator obviously have way more passion towards a series and already translated far ahead
>The cleaner/typesetter is a lazy fuck who takes age to finish a mere 10 pages chapter
Man I wish the translator can just find a new group already.
How do people get good at this shit?
Good at what, exactly?
Can you guys help a retard out, how do I use this?
What version of Python are you running? It works just fine on Python 3.7, but I suppose I could recode it use a different URL method.
So that is how I'm supposed to be doing it?
I have version 3.4.
Yeah, just tried it with your exact command and it worked fine. You can try this one where I imported more Python 2 stuff and changed the quotation marks for that one error.
pastebin.com
It's giving me the same error at line 23. I'll update it and try again then.
Which group is this?
me
Translator or cleaner/typesetter
latter
dealing with different motivation levels (and not everyone being a neet) is an important skill to have after a certain amount of time in the hobby
What series are you working on?
won't say
i will say, though, that torako apparently is TL'd to the end
>Torako
Good luck bud
>source + '/data/' + f'{c:04}' + '.jpg'
what the fug?
try f'{source}/data/{c:04}.jpg'
also what the fuck kind of variable name is c:04 is that even valid?
everyone in this thread needs to install miniconda and stop "having" versions of python. Make new conda environments every time you need some gay ass version of python for some homo use case and be done with it.
pastebin.com
Man, all the python people coming out to fix the script they didn't make. Makes optimization super easy for me though. c:04 is padding the number to 4 digits with leading zeroes since that's the naming scheme the website uses.
pastebin.com
Threw in regex string splitting before anyone gives me shit about not doing it via regex.
If this site uses the same reader as takeshobo, would it be possible to get the descrambled image if you used the script from this thread and just changed the url?
comip.jp
I used to have some sympathy for the slow typesetter, but then I gave it a shot myself and realized you can pump out a full volume every month easily even with a full time job and keeping up with anime/manga/vidya. There's literally no excuse for not being able to put out 10 measly pages a week and it pisses me off when some college kid acts like they're so busy and just can't possibly manage to commit to a schedule. If anyone's going to end up a NEET, it's those lazy fucks, not the ones with an actual work ethic.
The reader front end is the same, but the back end is not, so no, that script wouldn't work. I might go through the whole process again tomorrow for this site, but it'll take some large changes.
>you can pump out a full volume every month easily even with a full time job
I could do 1 tank/week, if I had scripts that is.
I'm going to give you shit anyways
Use a dict or namedtuple for huge parameter lists so people actually understand what's going in there instead of glazing over it.
Link related: pastebin.com
Get on my level, loser.
Since you all started using BeautifulSoup.
How about one of you adds os.mkdir(soup.title.string) and saves those fucking files to soup.title.string + os.sep + f'{c:04}.png'.
Since the site is shit you'll have to change the encoding to 'utf-8', though.
I've got pretty good redrawing skills but if I were to ever translate anything I'd put only slightly more effort than the minimum. I don't understand why some groups go through the effort of translating SFX.
this is python 3 so you would use pathlib instead
>Implying anyone who can't do this themselves will read it.
>implying you're not going to make the same homosexual mistakes in production code
I must be a total autist because everytime I see this thread I get excited about photoshop, and then realize it’s about scanlations instead of photoshop and I hide this gay shit.
I actually don't code for a living, and it shows. But when the cute guys and girls who can't do this much ask, it's nice to spoil them a bit.
tripfag go home
hey newbie here.
is there a program that screenshots manga from a reader easily?
also the mangas resolution is bigger than my screen.
thanks i just want to save stuff that i bought.
Lightshot
can someone translate or rip shirogane no nina chapters 105, 108-114, and 122
>have to debug bullshit "optimized" js code for my descrambler to work properly with 0 knowledge of js
I'll just stick with typesetting for now.
Why was it scrambled in the first place?
it's the universal icon for a marquee tool.
you ever open up an image editor before?
literally ms paint has this
to prevent people from easily ripping it.
What's the problem with easily being able to rip it?
I've been slowly uploading scans here.
nyaa.si
sukebei.nyaa.si
>all those great raws available
>everyone just scanlating garbage boys fantasy trash and yaoi
What a sad world we live in...
>those garbage
>great
haha
I only translate the ones that aren't obvious from context or if they make a joke work.
Thoughts on scanlating a manga that's getting animated anyway?
This is why the most ideal group size is 1. If you must, two with someone who will clean/typeset all in one and who can at least match raw release speed.
>Thoughts on scanlating a manga that's getting animated anyway?
NO READERS
O
R
E
A
D
E
R
S
too real
because you retards always start from chapter 1 instead of continuing from where the anime left off.
>those garbage
Dumb ESL.
>The plural form of garbage is also garbage.
american?
What is with groups that have such little QC care like this?
>Implying translating from some random middle chapter is good
Don't be a fucking retard.
Look up what an uncountable noun is, it should be in your English grammar book.
That almost happened to me a few times, but I also do everything by myself. Looking at the same pages a dozen times and quickly changing a typo makes you miss obvious things.
Maybe you should stop being an autistic faggot who cares too much about meaningless stuff instead.
Making fun of ESLs is a time-honored tradition.
>Implying translating from some random middle chapter is good
nobody cares. people want to see continuation from anime. people who scanlate niche manga that's already animated from chapter 1 ALWAYS (A L W A Y S) drop the series very very fast without even reaching double digit chapter.
Oh, I managed to catch a scanlation thread for once. Yay.
Probably the most basics of the basics of questions, but do you have a rule of the thumb when centering text for corner bubbles? I can't seem to get it right.
Rude, I managed to get to chapter 11
How can I redraw that speech bubble effect? I've tried to go over it with clone tool but it does not look good at all.
Ooh. The senryu redrawer.
Thank you for your work!
Just arrow-key mash and shift it with some quick fine pixel adjustment till it sits right. After a few dozen chapters you can just eyeball and tell what seems right. Both of those are fine enough but right is better IMO.
You can't really without stupid amounts of effort, you can do a more than fine enough job by putting the text over it first and just using a small 3-5px clone tool to copy the pattern from left/right though, and give the text a fat outline. No one will notice.
just make the japanese text white lmao
better to just go over the moon text with a large feathered white brush than to just make everything blurry like that
You've got enough material, so just clone the darker pattern all over the missing patch of hair and blot it with the lighter one, then use the lighter one again to redrawn the strands. Or vice versa. The text is gonna cover most of this, so you'll only need to actually redraw the most conspicuous parts.
EBJ finally forced the update, broke my ripping script. Gonna see if I can update it myself until someone who knows what they're doing posts a working version.
Scratch that, logging in seems to revert it to the old reader for now.
>scanlate manga
>gain popularity
>gets anime adaptation
>mfw people will compare my shit translations to someone else that is probably better than me.
This is my fear.
>pick up manga that's been dead for years
>already has anime adaptation
>notice that subs were dead for a while
>sub some episodes by literally copypastaing lines you translated onto the subs
>people like it
Best way to go, folks
oh I finally meet you, hello other senryu redrawer. It's me, redrawer number 11.
And yeah, basically what the other user said, clone left to right and skip any part that the eng text already covers. though you may just want to clone a circle from the from the top left of the bubble to the tippy top of that same bubble
btw how long did it take for you to do your part of 70.5. I'm asking because I only sat down to work on it the whole day after I foudn out you were done with your half.
Just looked at it real quick.
As the other user said, the back end is the same but the front end is a huge pain in the ass.
Getting the scrambled images and all is still quite easy (although still a pain if you want to automate it completely as not to visit the website at all).
And the JSON is hidden behind several JS layers so you need to go through the site step by step with the debugger and that's where the fun ends. If the other user actually does it, you lucked out. It's only a matter of finding the right JS functions and implementing them into his python code but that's what's painful.
The right one is better, but it would look even better if you move 'huh.' to it's own line.
Anybody tried PicPaw? I randomly got an email about it
Anyone knows where to find collectors to trade raws with? Are they on IRC or something? Or maybe I should look for Japanese people who scan? There are people who hoard a lot of rare porn and won't let it go so can I assume raw manga hoarders also exist?
>come into this thread to see what people are working on
>accidentally walk into /g/ instead
> redrawer number 11.
Ahaha. I'm happy you're still sticking with senryu.
Mmm. Right, noted with thanks. I guess it's an instinct thing after all.
Wait you also typeset!
Ganbare!
I don't see any arguments about meme languages here
It seems many scanlators are getting emails about it
Who is sending them? They got the emails from somewhere
Who's going?
Sounds like fun, meeting with a bunch of autist in person.
I had to check my Trash for the email. I bet MAL or some other popular service that got bought out sold the emails.
>MD is trying to do meetups now
Ew.
that's a good one user
now that i look at it again, that's a shit one user
actually, that's a meh one user
How good should you be before starting to translate porn? I'm reasonably confident that I can do simple oneshots, but I know I'm far from fluent and there may be times where I have to make a guess.
I don't mind being mediocre, but this world doesn't really need another shitter that clearly doesn't know either language.
>this world doesn't really need another shitter that clearly doesn't know either language
If incompetent fucktards weren't allowed to translate, we'd all be fapping to moon runes.
>porn
N4 is overqualified already desu
I translate porn semi-regularly and I've never studied nip in my life. The only thing people complain about is how ugly my typesetting and stuff is, never about the translation.
Do you just invent the lines or what?
No, I use a mix of what I've picked up by osmosis and a dictionary. Playing untranslated eroge has done wonders for my ability.
I thought I would never translate until I got real good. Instead I would volunteer as cleaner/typesetter/proofreader, and I kept working with "translators" who were fucking terrible. I would always end up fixing their dumb entry level mistakes. I decided that if those N5 goons were translating I could do it too.
If you're unsure you don't need to guess, you can always ask people on /djt/ or normalfag forums like 8lang and hinative. Just don't mention that it's for a manga translation.
you get real good by translating because it requires you to read and use dictionary to find out new vocabulary.
This gonna take forever to scan
But playing untranslated eroge is studying.
You can also google the word combination until you are fairly sure what it means.
alright, I guess I was worrying too much.
yeah, that solved most of my problems so far.