Poster:
|
stbalbach |
Date:
|
June 21, 2010 09:03:27pm |
Forum:
|
texts
|
Subject:
|
Re: user tpb uploads |
>how does tpb get access to the Google server to put the PDF there, before linking it to IA?
It's done with scripts automatically, anyone can access Google Books. First the script creates a blank shell on IA, uploads the books, processes it etc you can kinda follow the process by looking at the files of an example tpb book and the time stamps and contents of the xml files in the HTTP directory.
> its a breach of Google terms of use to grab PDFs from its server, and upload to other servers, like IA.
I remember reading IA has an agreement or understanding with Google to do it and they are OK with it.
>Quite often clicking on a IA link for a book, you get directed to Google Books, and find the PDF is a pay for, and you only get to see one chapter or so.
The HTTP link would have a local copy on IA. There is also usually a copy at Hathi Trust.
>why are there not heaps of other IA volunteers doing the same swiping from Google, then uploading to the IA.
The books are in the public domain. I don't see why anyone couldn't do it, other than time and effort. tpb is doing a good service, even if the quality of books is poor, better than nothing.
>why is not all there inventory on the IA by now?
It's a few million books, a lot of disk space and bandwidth to consider. Plus I don't know how automated the process is, it may just take a long time.
>uploaded by tpb on behalf of Google
I don't think tpb works on behalf of Google. My recollection is tpb is connected to IA, not Google. There was a post about it on this forum a few years ago.
Poster:
|
Time Traveller |
Date:
|
June 22, 2010 12:20:12am |
Forum:
|
texts
|
Subject:
|
Re: user tpb uploads |
that I did not know.
And does that not mean people can put malware onto Google Books?
I am close to calaspe, have to go, thanks for the information, and not giving me a hard time, I have learnt some new things about Google