Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
P
pdf-miner
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Qin Kaijie
pdf-miner
Commits
3400a661
Commit
3400a661
authored
Apr 15, 2024
by
quyuan
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
CI yaml
parent
90216330
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
3 additions
and
4 deletions
+3
-4
benchmark.yml
.github/workflows/benchmark.yml
+2
-1
ocr_badcase.py
tools/ocr_badcase.py
+0
-3
text_badcase.py
tools/text_badcase.py
+1
-0
No files found.
.github/workflows/benchmark.yml
View file @
3400a661
...
@@ -44,7 +44,8 @@ jobs:
...
@@ -44,7 +44,8 @@ jobs:
-
name
:
benchmark
-
name
:
benchmark
run
:
|
run
:
|
echo "start test"
echo "start test"
cd tools && python ocr_badcase.py pdf_json_label_0306.json ocr_dataset.json json_files.zip badcase.json overall.json base_data.json
cd tools && python text_badcase.py pdf_json_label_0306.json pdf_json_label_0229.json json_files.zip text_badcase text_overall base_data_text.json --s3_bucket_name llm-process-pperf --s3_file_directory qa-validate/pdf-datasets/badcase --AWS_ACCESS_KEY 7X9CWNHIVOHH3LXRD5WK --AWS_SECRET_KEY IHLyTsv7h4ArzReLWUGZNKvwqB7CMrRi6e7ZyUt0 --END_POINT_URL http://p-ceph-norm-outside.pjlab.org.cn:80
cd tools && python ocr_badcase.py pdf_json_label_0306.json ocr_dataset.json json_files.zip ocr_badcase ocr_overall base_data_ocr.json --s3_bucket_name llm-process-pperf --s3_file_directory qa-validate/pdf-datasets/badcase --AWS_ACCESS_KEY 7X9CWNHIVOHH3LXRD5WK --AWS_SECRET_KEY IHLyTsv7h4ArzReLWUGZNKvwqB7CMrRi6e7ZyUt0 --END_POINT_URL http://p-ceph-norm-outside.pjlab.org.cn:80
notify_to_feishu
:
notify_to_feishu
:
if
:
${{ always() && !cancelled() && contains(needs.*.result, 'failure') && (github.ref_name == 'master') }}
if
:
${{ always() && !cancelled() && contains(needs.*.result, 'failure') && (github.ref_name == 'master') }}
needs
:
[
pdf-test
]
needs
:
[
pdf-test
]
...
...
tools/ocr_badcase.py
View file @
3400a661
...
@@ -867,7 +867,6 @@ def main(standard_file, test_file, zip_file, badcase_path, overall_path,base_dat
...
@@ -867,7 +867,6 @@ def main(standard_file, test_file, zip_file, badcase_path, overall_path,base_dat
save_results
(
result_dict
,
overall_report_dict
,
badcase_file
,
overall_file
)
save_results
(
result_dict
,
overall_report_dict
,
badcase_file
,
overall_file
)
result
=
compare_edit_distance
(
base_data_path
,
overall_report_dict
)
result
=
compare_edit_distance
(
base_data_path
,
overall_report_dict
)
<<<<<<<
HEAD
if
all
([
s3_bucket_name
,
s3_file_directory
,
aws_access_key
,
aws_secret_key
,
end_point_url
]):
if
all
([
s3_bucket_name
,
s3_file_directory
,
aws_access_key
,
aws_secret_key
,
end_point_url
]):
try
:
try
:
...
@@ -875,8 +874,6 @@ def main(standard_file, test_file, zip_file, badcase_path, overall_path,base_dat
...
@@ -875,8 +874,6 @@ def main(standard_file, test_file, zip_file, badcase_path, overall_path,base_dat
upload_to_s3
(
overall_file
,
s3_bucket_name
,
s3_file_directory
,
aws_access_key
,
aws_secret_key
,
end_point_url
)
upload_to_s3
(
overall_file
,
s3_bucket_name
,
s3_file_directory
,
aws_access_key
,
aws_secret_key
,
end_point_url
)
except
Exception
as
e
:
except
Exception
as
e
:
print
(
f
"上传到S3时发生错误: {e}"
)
print
(
f
"上传到S3时发生错误: {e}"
)
=======
>>>>>>>
ff8f62aa3c28facc192104387f131d87978064fc
print
(
result
)
print
(
result
)
assert
result
==
1
assert
result
==
1
...
...
tools/text_badcase.py
View file @
3400a661
...
@@ -886,6 +886,7 @@ def main(standard_file, test_file, zip_file, badcase_path, overall_path,base_dat
...
@@ -886,6 +886,7 @@ def main(standard_file, test_file, zip_file, badcase_path, overall_path,base_dat
except
Exception
as
e
:
except
Exception
as
e
:
print
(
f
"上传到S3时发生错误: {e}"
)
print
(
f
"上传到S3时发生错误: {e}"
)
print
(
result
)
print
(
result
)
assert
result
==
1
if
__name__
==
"__main__"
:
if
__name__
==
"__main__"
:
parser
=
argparse
.
ArgumentParser
(
description
=
"主函数,执行整个评估流程。"
)
parser
=
argparse
.
ArgumentParser
(
description
=
"主函数,执行整个评估流程。"
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment