Microsoft opens sources Python tool for converting files and office documents to Markdown
MarkItDown is a utility for converting various files to Markdown (e.g., for indexing, text analysis, etc). It supports:
  • PDF
  • PowerPoint
  • Word
  • Excel
  • Images (EXIF metadata and OCR)
  • Audio (EXIF metadata and speech transcription)
  • HTML
  • Text-based formats (CSV, JSON, XML)
  • ZIP files (iterates over contents)
4
2 comments
Marcio Pacheco
7
Microsoft opens sources Python tool for converting files and office documents to Markdown
Data Alchemy
skool.com/data-alchemy
Your Community to Master the Fundamentals of Working with Data and AI — by Datalumina®
Leaderboard (30-day)
Powered by