Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UnicodeEncodeError: 'gbk' codec can't encode character '\u2022' in position 1195: illegal multibyte sequence #198

Open
ArchieZhao opened this issue Dec 22, 2024 · 4 comments

Comments

@ArchieZhao
Copy link

Traceback (most recent call last):
File "D:\Program\AnacondaEnv\markitdown_env\lib\runpy.py", line 196, in _run_module_as_main
return run_code(code, main_globals, None,
File "D:\Program\AnacondaEnv\markitdown_env\lib\runpy.py", line 86, in run_code
exec(code, run_globals)
File "D:\Program\AnacondaEnv\markitdown_env\Scripts\markitdown.exe_main
.py", line 7, in
File "D:\Program\AnacondaEnv\markitdown_env\lib\site-packages\markitdown_main
.py", line 43, in main
print(result.text_content)
UnicodeEncodeError: 'gbk' codec can't encode character '\u2022' in position 1195: illegal multibyte sequence

@hanchan78
Copy link

Similar issues with me:
UnicodeEncodeError: 'charmap' codec can't encode character '\u2003' in position 3849: character maps to

On another document:
UnicodeEncodeError: 'charmap' codec can't encode character '\uf416' in position 6988: character maps to

@fishfen
Copy link

fishfen commented Dec 24, 2024

I encountered the same problem too.
UnicodeEncodeError: 'gbk' codec can't encode character '\u2217' in position 37: illegal multibyte sequence

@tonygeneral
Copy link

I encounter the same problem too meidi2023.pdf

@l-lumin
Copy link
Contributor

l-lumin commented Dec 26, 2024

I think it was fixed in PR #116

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants