2007年12月15日 星期六

古騰堡計畫作業四 - 文本編號

本學期的古騰堡計畫作業到此完畢,謝謝各位同學的參與,古騰堡計畫的同仁感謝各位同學的參與。

繳交內容
- 在 http://gutenberg.wikispaces.com/ 內,登錄取取得的文本編號及網址
- 繳交該網頁,並標示姓名及學號

古騰堡計畫作業到此完畢

繳交日期:
開放近用資訊 12/18 10:10
資訊組織 12/19 08:10
公共圖書館 12/20 10:10
網路社群建立與應用 12/20 15:40

說明:
古騰堡編輯大衛‧魏吉(David Widger)陸續核發文本編號,同時也駁回部份的申請案。

請依照古騰堡計畫 - 認證碼的建議,展讀大衛‧魏吉等編輯的電子郵件,與同學/學長討論,必要時,請攜帶所有資料印本,與我共同研究。

尚未取得認證碼的同學,加油!

2007年12月13日 星期四

古騰堡計畫 - 認證碼


回覆 Jessy 同學的信件
請參考,
多查字典,對於作業的進行,較有幫助


毛慶禎 <mao@lins.fju.edu.tw> 2007年12月14日 下午 1:15
收件者: jessy03162005@
Jessy.


謝謝您的來信,看得出來,您對學習抱持很大的熱誠,
請根據以下的原則進行,短期內必然有極為驚人的進展:

1. 找本英文字典,袖珍英漢辭典 / 謝國平, 三民書局,2006,274 元
2. 用翻譯網站,http://www.google.com/translate_t

把英文字檢查一遍,細讀其意義

根據您的來信及圖檔,回覆如下:

1. Clearance OK
完整的句子是:
Provide your Clearance OK key to upload a completed eBook:
運用翻譯網站,http://www.google.com/translate_t
譯為: 提供你關好關鍵上傳一個完成電子書
再用字典查詢:
Provide -> 提供
your -> 你的
Clearance OK key -> 認證碼,您已知道
to -> 去做一件事
upload -> 上傳
a completed -> 一本完整的
eBook -> electronic Books 的簡寫,即電子書
2. Status: Not OK
不要懷疑,就是 Not OK,可能的原因如下

3. Notes from the clearance team:
We can't clear a post-1923 text that was not published in the US, sorry.
If this is actually a 1929 reprint of a classical earlier item, please
resubmit your clearance request with the additional information.
這是對方給你的意見,請依照前述的方法,查詢字典及翻譯網站後,再向同學、學長請期,
每個星期二至星期四 07:00 - 15:30 我都在文開樓 823 研究室,除了上課時間外,歡迎來與我共同討論,一齊精進。


4. Title: 官場現形記guan chang xian xing ji
不能有中文,那些老外看不懂中文

這封信同時公告在 http://fjulins.blogspot.com/,給其他同學參考

在 2007/12/14,jessy03162005@撰寫:
>
> 老師你好 我是Jessy 修開放與近用資訊課程
>
> 夾帶檔案上面寫Clearance OK 指的是我已經取的認證碼
> 可是他又寄給我email內容(底下po的)是寫Not OK
> 所以我不太清楚我到底是有沒有取得認證碼?
>
> 謝謝^^
>
> Project Gutenberg copyright clearance submission
> automated status update notification.
>
> Title: 官場現形記guan chang xian xing ji
> Author1: li bao jia
> Status: Not OK
> Notes from the clearance team:
> We can't clear a post-1923 text that was not published in the US, sorry.
> If this is actually a 1929 reprint of a classical earlier item, please
> resubmit your clearance request with the additional information.
>
> Questions, comments or concerns? Please respond by email.
>
> Thanks for your work on this eBook!
> -- The Copyright Clearance Team (Juliet & Greg)
>
> ____________________________________________________________________________________
> 怎樣減少垃圾信?只要看到垃圾信,立即按下「這是垃圾信」按鈕,垃圾信剋星幫助你清除你信箱垃圾。
>


--
祝您步步高昇


毛慶禎, mao at lins.fju dot edu.tw,
輔仁大學圖書資訊學系副教授
(02) 29052334 (公) (02) 29017405 (傳真)

這封信可能含有私密性的資訊, 如果您不是預期的收信人, 請把它刪除並知會送件人。毛慶禎不能為未經授權的散布、複製或使用此信件負責。

古騰堡計畫 - 姓與名

大衛回覆Jocelyn的詢問
申請 認證碼時,在 Notes欄位裡,再寫一次作者/譯者等的姓(last name)與名(first name)


David Widger <cdwidger@> 2007年12月14日 上午 1:26
收件者: Jocelyn <jocelynj8648j@>
副本: mao@
Dear Wan Jei Chen,

Thank you for your email confirming the first and last names of the various persons in the data of these files.

For future contributions it will be a great help if you and your colleagues could place a listing of the first and last names in the NOTES section of the Project Gutenberg upload form exactly as you have done in your email below.

With kindest regards,

David

重新申請認證碼

發覺錯誤,請重新申請認證碼
並於notes 欄位說明,請求對方忽略(disregard)前次的request ID(你的是33463)
以此次的為準

2 封郵件
494100698 2007年12月13日 下午 8:09
回覆: 494100698
收件者: mao
毛老師:
我是修您開放近用資訊課程圖資三 王秀文,
今天我有重新向古騰堡提出申請,附上相關資料,
麻煩老師幫我看看有沒有問題並請古騰堡作業人員幫我審核是否通過
但我把作者的名和姓寫反了,這樣還可以嗎?
書名:恨海
作者:吳趼人(A.D.1866-A.D.1910)
書完成的年代:1906
附檔一:申請文本編號的尚等待審核網頁
附檔二:書名頁 26.5K
附檔三:版權頁 21.4K
附檔四:正文第一頁 55.3K
以上,麻煩老師,謝謝。

申請文本編號資料.JPG


檔案1.JPG


檔案2.JPG


檔案3.JPG



毛慶禎 <mao> 2007年12月14日 上午 10:52
收件者: 494100698
秀文,


為了配合對方的要求,
出版社名稱,請給漢語拚音
除了正文之外,全部給英文或漢語拚音

可以重新再送一次
並於notes 欄位說明,請求對方忽略(disregard)前次的request ID(你的是33463)
以此次的為準

您的信及此次回信,都將貼在 http://fjulins.blogspot.com/,
供其他同學參考

在 2007/12/13,494100698 撰寫:
[隱藏引用文字]
--
祝您步步高昇


毛慶禎, mao at lins.fju dot edu.tw,
輔仁大學圖書資訊學系副教授
(02) 29052334 (公) (02) 29017405 (傳真)

這封信可能含有私密性的資訊, 如果您不是預期的收信人, 請把它刪除並知會送件人。毛慶禎不能為未經授權的散布、複製或使用此信件負責。

2007年12月12日 星期三

古騰堡計畫注意事項 - 上傳檔案

根據古騰堡編輯大衛‧魏吉(David Widger)的來信,上傳的檔案還有一些亟待處理的事項取得認證碼後,表示該圖書被認證為公版著作。

內容不合於古騰堡的規定時,不能被接受,就沒有古騰堡文本編號

據大衛‧魏吉表示,以下的兩封信同時寄給相關的參與者;我建議以翻譯軟體譯為中文,並輔以字典,
http://www.google.com/translate_t
大衛‧魏吉歡迎參與者直接回信
由我代為轉達也是很好的管道

以下為此兩信的大意

--->

我,大衛‧魏吉, 是古騰堡計畫的編輯,我的工作是:

  1. 收取志工寄來的檔案
  2. 檢查文句的錯誤
  3. 加上前言、後語及後設資料
  4. 張貼

今天,收到克雷格‧紐比教授轉來27件中文公版著作檔案,稍晚又收到4件,並告知將持續收到更多的,已經把9件加1件貼在網路上,24小時內應該可以上線。

那些未符合古騰堡計畫要求的檔案,我還會找時間細看,並與這些志工直接溝通。

目前,有幾個議題,需要討論
  1. 請不要加入前言後語,我必須刪除它們;這個程序是自動的,不勞志工費心
  2. 部份志工不瞭解上傳檔案的表格,以致提供錯誤的資訊,例如,把譯者填入作者的欄位。
  3. 辨明姓與名。古騰堡計畫的規則是,先給名,再給姓。請這位同學,特別協助我,弄清楚作者的姓與名,Xu Jia-Fu <f780424@????>
  4. 我的編輯器可以讀取utf-8 或 utf-16,但不能看到Big-5;我們的張貼程序可以處理UTF-8,但不能處理UTF-16 及 Big-5。請給utf-8編碼的檔案。
  5. 一個作品祗能上傳一個檔案。
  6. 來自微軟文書處理軟體的 .rtf 或 .doc 格式,必須轉換為 UTF-8 編碼的 .txt 檔。
  7. 有些電腦的瀏覽器(IE 6.0或更舊),不能處理UTF-8,因此在上傳檔案的表格裡,填入的中文都不能被閱讀,請 fanghsuan hsu <poiuy0127@????> 協助我們。
以下的檔案已經處理好,給了古騰堡文本編號,24小時內,可以上線,請確認它們的作者姓及名:
若需更改,請mail告知。23814, 223816, 23817, 23818 已是有效的連結。

真是不幸,我的中文程度和各位的英文程度一樣糟,如克雷格‧紐比教授所述,我們必將克服此障礙,讓中文公版著作大量加入古騰堡計畫裡。

謝謝您幫我轉達上述的意思
請保持連繫

<-----


信件一
Submission of texts in Chinese to Project Gutenberg


David Widger <cdwidger@gmail.com>

2007年12月13日 上午 2:02

收件者: "Prof. Mao" <mao@lins.????>

副本: "Newby"" <"Greg>


Four additional files were received this morning,
one of which was satisfactory and has been posted
on the internet as etext #23827. Email has been
sent to the other contributors describing the problems.

In making the list of problems in my email of
yesterday I forgot to include a very important
one: legibility of the data placed in the PG
upload form. If the data such as the author's
name or the ebook Title is entered into the PG
upload form from a computer which is producing
characters other than of the ASCII, or
iso-8850-1, or UTF-8 then the note forwarded to
us to use in the posting of the file cannot be
read. The data is just non-meaningful
symbols. Without this data we cannot process the
file even though the UTF-8 text is fine in every other way.

This is an example of an upload note
demonstrating this problem which we of course had
to report to the contributor as unsatisfactory.

TITLE=ƒx¶YÕýÁx
EDITOR=¼Òñ˜ —î
CREDIT=http://ef.cdpa.nsysu.edu.tw/ccw/01/yli.htm
NOTIFY=fanghsuan hsu <poiuy0127@hotmail.com>
FILETYPE=other
LANGUAGE=Chinese
FILES=xY.zip


Whatever help you can give us in communicating
the problems I have outlined to your students will be much appreciated.

If you have questions regarding the PG Posting
process or if I have not made myself clear on any
of the points, kindly let me know.

With best regards,

David




信件二
Submission of texts in Chinese to Project Gutenberg


David Widger <cdwidger@gmail.com>

2007年12月12日 上午 7:43

收件者: "Prof. Mao" <mao@lins????>

副本: "Newby"" <"Greg>


Dear Professor Mao,

I am one of the Project Gutenberg Editors who receive files from
volunteers. I and my colleagues check these files for conformity to
PG standards, check them for textual errors (if they are in languages
we read), add the heading and trailing PG metadata and legalize that
you see at the beginning and end of each file--and post them to the internet.

Today I received 27 files in Chinese from volunteers whom Professor
Newby informs me are your students--he also indicates I should expect
many more of these files. Of the twenty-seven submissions there were
nine which, with a bit of work, I was able to post on the
internet. They will appear in the PG catalog in about 24 hours.

I have written email to several of the contributors describing for
their instruction various problems with the nine posted files which I
was able to adjust. As to the other 18 contributors I have not had
time to delve deeply enough into the problems but will do so in the
coming days and communicate directly with them.


There are several issues identified so far:

1. A PG header/trailer attached to the files by some of the
volunteers which needed to be deleted before I could start processing
the file (though sometimes there is more info in these as to author,
etc. than in the PG upload note). Our posting process automatically
adds the PG header and trailer so there is no need for the
contributor to try to add these to their files.

2. The volunteers in many cases seem not to be able to read or
understand the Project Gutenberg upload form thus inserting the wrong
information: for example the Translator's name in some cases entered
as the Author--plus other variations.

3. Determining the proper order for Chinese names. At times I found
the last name first, in English, French German and many other
languages it would be easy for me to correct but with no knowledge of
Chinese I cannot figure out which is which. In my feedback email to
four of the contributors today I have asked for guidelines in
this. The PG standard for the names of author, editor, translator,
illustrator are first name first and last name last. This standard is
necessary so our cataloguers get it right.

4. I can see in my unicode editor what looks like normal Chinese
characters in the files which are utf-8 or utf-16 but the Big-5
character set is beyond me. None of my editors can handle this
charset and I do not even know if any of the files submitted today
are in Big-5. Our automated posting program can handle UTF-8, but
not UTF-16 or Big-5. In those files posted today which came in as
UTF-16 I recoded them to UTF-8 using the recode.exe program before
starting the posting process. Unfortunately there are several files
in the 18 remaining unposted files which are labeled as UTF-8 which
are neither UTF-8 or UTF-16. I cannot determine what charset they
are and will have to ask the volunteer to resubmit them in valid
UTF-8 (or in UTF-16 if they are unable to accomplish the recoding to UTF-8).

5. Also there are several files in the unposted 18 which contain
several or many individual text files. PG can only accept one text
file of any charset for an eBook. That file gets the PG
filenumber. So sets with several text files must have these all
combined into one text file. Likewise with several html files, it is
best if there be just one file.

6. Several files were sent as MS Word files with the .rtf or .doc
extension. PG does include these formats but requires that when we do
so a .txt file (in the case of Chinese files UTF-8 format) also be included.


Please take a look at the following files I posted this morning. Some
or all of these may need to be reposted once I figure out which name
is the last name for PG cataloguing purposes:

http://www.gutenberg.org/2/3/8/1/23814

http://www.gutenberg.org/2/3/8/1/23816

http://www.gutenberg.org/2/3/8/1/23817

http://www.gutenberg.org/2/3/8/1/23818

http://www.gutenberg.org/2/3/8/1/23821

http://www.gutenberg.org/2/3/8/1/23822

http://www.gutenberg.org/2/3/8/1/23823

http://www.gutenberg.org/2/3/8/1/23824

http://www.gutenberg.org/2/3/8/1/23825


If corrections or changes are needed kindly send advice by email and
I will be glad to make the corrections.

I will greatly appreciate your help in instructing your students as
to the PG requirements above. In several of the email back and forth
between some of them it is obvious that they do not understand much
of what I write and of course I have equal difficulty understanding them.

With all this said Prof. Newby, Project Gutenberg and I are delighted
to see some eBooks in our catalog in the Chinese language. With your
help I believe we will be able to post many more.

With kindest regards,

David Widger
http://www.gutenberg.net.au/widger/home.html















I am one of the Project Gutenberg Editors and have received your
files for posting. Thank you very much for this contribution.

You have attached PG header and trailer to the file which is not
necessary as these are automatically produced during the posting
process. In future files there is no need for you to do this.

Not being familiar with Chinese names I am unsure if I have the
proper order of first and last names for the translators, editors,
and for you. Please see my list below and let me know if this is
correct--and if not please make the needed corrections. For example
in your upload note one of the translators is written hui neng shi
and in another place as Shi Hui-Neng, so you can understand my confusion.

Also I do not know who should be named as the author of this work. Is
the author known? If not we would normally put the author in the PG
catalog as Anonymous.

Again many thanks for submitting this file to PG.

David Widger



Uploaded at December 10, 2007 3:38 pm PST by Xu Jia-Fu <f780424@yahoo.com.tw>

TITLE=Liu Zu Tan Jing
SUBTITLE=Original publication year is A.D.713 at Tang Dynasty
AUTHOR=
TRANSLATOR=Shi Hui-Neng and Shi Fa-Hai
EDITOR=Shi Fa-Hai
CREDIT=Xu Jia-Fu
NOTIFY=Xu Jia-Fu <f780424@yahoo.com.tw>
FILETYPE=other
LANGUAGE=Chinese
FILES=33202-0.txt
SIZE=90223
CLEARED=OK 20071205045346shi liu zu tan jing hui neng shi 1992:p



2007年12月11日 星期二

古騰堡計畫 - 上傳內文

回覆同學的詢問,請大家參考:

> 當我們認證通過,要進行內文上傳時
> 1. 內文中要加上您提供的英文前後文嗎?

祗要給中文的內容,根據鍵入的其他資料,
由對方的程式自動產生英文的前後文

> 2.檔案形式的選擇,應該選擇哪一項?
> ASCII (7-bit)
> Latin-1 (ISO-8859-1)
> Other 8-bit
> Unicode
> HTML
> XML
> TeX/LaTeX
> PDF
> RTF
> Other/not listed

不知道你是怎麼做的,
把檔案拿來給我看,才會知道

我用 nvu 1.0 編輯,內定格式是 UNICODE - UTF8
請從這裡下載
http://moztw.org/nvu/

正確的認證碼申請內容

克雷格‧紐比(Greg Newby)建議,正確的認證申請內容,來信大意如下:

  1. 每個書影在 100-150K 之間,以 PNG、GIF、JPG 格式呈現
  2. 參閱附檔 china-1.png, china-2.png

Greg Newby <gbnewby@pglaf.org> 2007年12月12日 上午 3:15
回覆: gbnewby@pglaf.org
收件者: <mao@lins.????>, "Michael S. Hart" <hart@pobox.com>, John Guagliardo <john@gutenberg.cc>, 496100044@mail.?????????, <sony.li????@gmail.com>
Sorry for the further follow-up, and large attachment.
The attached PNG files shows some "ideal" information submitted.
The only problem is that the images are way too big: 100-150K per
image (PNG, GIF or JPG) is sufficient.

The information included for that request was very good.

We have started to receive eBook submissions. I will send
further information as they are evaluated.
-- Greg
[隱藏引用文字]
2 個附件

china-1.png
338K

china2.png
24K

2007年12月10日 星期一

認證碼回覆

根據古騰堡計畫工作人員克雷格‧紐比(Greg Newby)的回信,大意如下:

1. 已經取得認證碼(Clearance Key)者,請到此網頁
http://upload.pglaf.org,繼續上傳全文

2. 未取得認證碼者,請重新再提出認證申請

- 原始出版年,必須符合公版著作的規定(美國規定為 1923 年以前的出版品)
- 增加的注解、評論等,也需符合公版著作的規定
- 重印本必須沒有新的資料,才能符合公版著作的規定
- 作者往生於 1912 之前
- 呈交書名頁及版權頁的書影
- 書名及著者譯為英文,或漢語拚音


祝各位同學
作業愉快

Greg Newby <gbnewby@pglaf.org>


2007年12月11日 上午 5:59
回覆: gbnewby@pglaf.org
收件者: 毛慶禎
副 本: "Michael S. Hart" <hart@pobox.com>, John Guagliardo <john@gutenberg.cc>, 496100044@mail.XXXXx, 美君
On Sun, Dec 09, 2007 at 09:23:12AM +0800, ????????? wrote:
> Hi Michael and fellows,
>
>
> There are hundred Chinese eBooks sending to you by my students ask for
> copyright clearance during the last two weeks.
>
> Would you kindly let them pass your clearance as soon as possible?


Dear Prof. Mao:

I am working through these clearances, and have approved
many. For others, we cannot determine public domain status
based on the information provided.

For the items that are cleared, they can be submitted using
the clearance key at http://upload.pglaf.org

For the others, they will need to be resubmitted with further
details. We need to know:

- the original publication date, if prior to 1923
- whether any new materials were added (such as a new introduction)
- that the reprint doesn't have any other materials that might
qualify for a new reprint
- that the author died prior to 1912
- images of the title page & verso page (some clearance requests
did not include these)
- translation to English of the title and other relevant information
from the image of the title page & verso

Note that we cannot take items which were originally published
*after* 1923.

Thanks for your efforts. We're really looking forward
to getting these items into our collection.
-- Greg Newby

2007年12月9日 星期日

認證碼

經過一段時間的通溝
古騰堡計畫的經理 Greg Newby 非常熱心
將以最快的速度,參與我們的工作
請耐心等候
Newby 先生的回信,如下圖