Category:
Clean HTML for LLM
Strip HTML tags and scripts to extract pure text for AI processing.
Clean Legal Contracts
Prepare legal documents for AI analysis. Redact parties and dates automatically.
Clean Resume Text
Clean up resume text for analysis. Remove weird PDF artifacts and bullets.
Clean Email Threads
Remove 'On [Date] wrote:' headers and signatures from email threads.
Clean Text for Llama 3
Prepare text for local Llama 3 inference. Remove formatting noise.
Remove Broken Line Breaks
Fix broken text with random line breaks from PDFs or websites.