Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Commit83b4f4a

Browse files
author
minjk-bl
committed
Apps > PDF - edit import package code, and defined function code
1 parent882d71f commit83b4f4a

File tree

1 file changed

+6
-4
lines changed

1 file changed

+6
-4
lines changed

‎src/common/pycode.js‎

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,8 @@ define ([
2525

2626
constPDF_IMPORT=`import pandas as pd
2727
import fitz
28-
from nltk.tokenize import sent_tokenize`;
28+
import nltk
29+
nltk.download('punkt')`;
2930

3031
constPDF_FUNC=`def vp_pdf_get_sentence(fname_lst):
3132
'''
@@ -43,14 +44,15 @@ from nltk.tokenize import sent_tokenize`;
4344
text_lst = [block[4] for block in block_lst if block[6] == 0]
4445
text = '\\n'.join(text_lst)
4546
46-
sentence_lst.extend([sentence for sentence in sent_tokenize(text)])
47+
sentence_lst.extend([sentence for sentence innltk.sent_tokenize(text)])
4748
4849
doc.close()
49-
except:
50+
except Exception as e:
51+
print(e)
5052
continue
5153
5254
df_doc = pd.DataFrame({
53-
'fname': fname,
55+
'fname': fname.split('/')[-1],
5456
'sentence': sentence_lst
5557
})
5658
df = pd.concat([df,df_doc])

0 commit comments

Comments
 (0)

[8]ページ先頭

©2009-2025 Movatter.jp