-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
The package requires tabulate
to function, and despite it being mentioned in requirements.txt it does not get installed via pip install metawarc
.
I may address this with a pull, when I get some free time. Thanks for the package, it is helpful for testing code that reads and writes WARCs.
Install:
(venv) kev@ganymede:~/projs/coisas-deliciosas$ pip install metawarc
Collecting metawarc
Downloading metawarc-1.1.1.tar.gz (15 kB)
Installing build dependencies ... done
Getting requirements to build wheel ... done
Preparing metadata (pyproject.toml) ... done
Requirement already satisfied: warcio in ./venv/lib/python3.13/site-packages (from metawarc) (1.7.5)
Collecting pdfminer (from metawarc)
Downloading pdfminer-20191125.tar.gz (4.2 MB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 4.2/4.2 MB 3.3 MB/s eta 0:00:00
Installing build dependencies ... -
done
Getting requirements to build wheel ... done
Preparing metadata (pyproject.toml) ... done
Collecting click (from metawarc)
Using cached click-8.1.8-py3-none-any.whl.metadata (2.3 kB)
Collecting rich (from metawarc)
Using cached rich-13.9.4-py3-none-any.whl.metadata (18 kB)
Requirement already satisfied: lxml in ./venv/lib/python3.13/site-packages (from metawarc) (5.3.1)
Collecting hachoir (from metawarc)
Downloading hachoir-3.3.0-py3-none-any.whl.metadata (2.9 kB)
Collecting sqlalchemy (from metawarc)
Downloading SQLAlchemy-2.0.38-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (9.6 kB)
Collecting pycryptodome (from pdfminer->metawarc)
Downloading pycryptodome-3.21.0-cp36-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (3.4 kB)
Collecting markdown-it-py>=2.2.0 (from rich->metawarc)
Using cached markdown_it_py-3.0.0-py3-none-any.whl.metadata (6.9 kB)
Requirement already satisfied: pygments<3.0.0,>=2.13.0 in ./venv/lib/python3.13/site-packages (from rich->metawarc) (2.19.1)
Collecting greenlet!=0.4.17 (from sqlalchemy->metawarc)
Using cached greenlet-3.1.1-cp313-cp313-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl.metadata (3.8 kB)
Requirement already satisfied: typing-extensions>=4.6.0 in ./venv/lib/python3.13/site-packages (from sqlalchemy->metawarc) (4.12.2)
Requirement already satisfied: six in ./venv/lib/python3.13/site-packages (from warcio->metawarc) (1.17.0)
Collecting mdurl~=0.1 (from markdown-it-py>=2.2.0->rich->metawarc)
Using cached mdurl-0.1.2-py3-none-any.whl.metadata (1.6 kB)
Using cached click-8.1.8-py3-none-any.whl (98 kB)
Downloading hachoir-3.3.0-py3-none-any.whl (650 kB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 650.4/650.4 kB 2.8 MB/s eta 0:00:00
Using cached rich-13.9.4-py3-none-any.whl (242 kB)
Downloading SQLAlchemy-2.0.38-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.2 MB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 3.2/3.2 MB 3.1 MB/s eta 0:00:00
Using cached greenlet-3.1.1-cp313-cp313-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl (615 kB)
Using cached markdown_it_py-3.0.0-py3-none-any.whl (87 kB)
Downloading pycryptodome-3.21.0-cp36-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.3 MB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.3/2.3 MB 3.0 MB/s eta 0:00:00
Using cached mdurl-0.1.2-py3-none-any.whl (10.0 kB)
Building wheels for collected packages: metawarc, pdfminer
Building wheel for metawarc (pyproject.toml) ... done
Created wheel for metawarc: filename=metawarc-1.1.1-py2.py3-none-any.whl size=16703 sha256=99a787b6b0b0a80d5a8d13e665fa46625909136b9c46349bcf6942c304e3d1d5
Stored in directory: /home/kev/.cache/pip/wheels/f8/fa/9a/d931532c443a7627d7cf9fc340354c105930a19b033b7f38ce
Building wheel for pdfminer (pyproject.toml) ... done
Created wheel for pdfminer: filename=pdfminer-20191125-py3-none-any.whl size=6140774 sha256=2b79fafed2638b3f2e85b9c70a2b6a5e1bdb4cfa33b9c8dc526c7278c4144d0c
Stored in directory: /home/kev/.cache/pip/wheels/b6/45/85/3db811d181e6ac5b5ea9187c72cdd94ff1df2642d97cf3106a
Successfully built metawarc pdfminer
Installing collected packages: hachoir, pycryptodome, mdurl, greenlet, click, sqlalchemy, pdfminer, markdown-it-py, rich, metawarc
Successfully installed click-8.1.8 greenlet-3.1.1 hachoir-3.3.0 markdown-it-py-3.0.0 mdurl-0.1.2 metawarc-1.1.1 pdfminer-20191125 pycryptodome-3.21.0 rich-13.9.4 sqlalchemy-2.0.38
Run with error:
(venv) kev@ganymede:~/projs/coisas-deliciosas$ metawarc
Traceback (most recent call last):
File "/home/kev/projs/coisas-deliciosas/venv/bin/metawarc", line 8, in <module>
sys.exit(main())
~~~~^^
File "/home/kev/projs/coisas-deliciosas/venv/lib/python3.13/site-packages/metawarc/__main__.py", line 10, in main
from .core import cli
File "/home/kev/projs/coisas-deliciosas/venv/lib/python3.13/site-packages/metawarc/core.py", line 7, in <module>
from .cmds.analyzer import Analyzer
File "/home/kev/projs/coisas-deliciosas/venv/lib/python3.13/site-packages/metawarc/cmds/analyzer.py", line 3, in <module>
from tabulate import tabulate
ModuleNotFoundError: No module named 'tabulate'
Install missing tabulate
:
(venv) kev@ganymede:~/projs/coisas-deliciosas$ pip install tabulate
Collecting tabulate
Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB)
Downloading tabulate-0.9.0-py3-none-any.whl (35 kB)
Installing collected packages: tabulate
Successfully installed tabulate-0.9.0
Run successfully:
(venv) kev@ganymede:~/projs/coisas-deliciosas$ metawarc
Usage: metawarc [OPTIONS] COMMAND [ARGS]...
Options:
--help Show this message and exit.
Commands:
analyze Analysis of the WARC
dump Dumps content by query
export Exports WARC file headers or warcio index
index Generates WARC file index
list Lists urls inside WARC file
metadata Extracts metadata from files inside WARC file or another file...
stats Generates WARC file index
Metadata
Metadata
Assignees
Labels
No labels