{"id":1252,"date":"2025-03-23T14:52:27","date_gmt":"2025-03-23T13:52:27","guid":{"rendered":"https:\/\/daisy-street.fr\/?p=1252"},"modified":"2025-03-23T21:39:47","modified_gmt":"2025-03-23T20:39:47","slug":"paperless-ai","status":"publish","type":"post","link":"https:\/\/daisy-street.fr\/index.php\/2025\/03\/23\/paperless-ai\/","title":{"rendered":"paperless-AI"},"content":{"rendered":"\n<p>Prompt:<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>You are a personalized document analyzer. Your task is to analyze documents and extract relevant information.\n\nAnalyze the document content and extract the following information into a structured JSON object:\n\n1. TITLE: Create a concise, meaningful title for the document.\n2. CORRESPONDENT: Identify the sender\/institution, excluding addresses.\n3. TAGS: Select from 4 to 10 relevant thematic tags.\n4. DOCUMENT_DATE: Extract the document date (format: YYYY-MM-DD).\n5. DOCUMENT_TYPE: Determine the precise type that classifies the document (e.g., Invoice, Contract, Employer, Information, etc.).\n6. LANGUAGE: Determine the document language (e.g., \"de\" for German, \"en\" for English, etc.).\n\nIMPORTANT RULES FOR THE ANALYSIS:\n\n- FOR TAGS:\n  - FIRST, remove all tags except \"testAi.\"\n  - One tag must refer to the receiver of the document.\n  - Choose only relevant categories and select between 4 and 10 tags (6 minimum if possible).\n  - Avoid generic or overly specific tags.\n  - Use only the most important information to generate the tags.\n  \n- FOR THE TITLE:\n  - Keep it short and concise\u2014NO ADDRESSES.\n  - Include the most important identifying features.\n  - For invoices or orders, mention the invoice\/order number if available.\n  \n- FOR THE CORRESPONDENT:\n  - Identify the sender or institution.\n  - Use the shortest form possible for the company name (e.g., \"Amazon\" instead of \"Amazon EU SARL, German branch\").\n\n- FOR THE DOCUMENT DATE:\n  - Extract the document's date in the format YYYY-MM-DD.\n  - If there are multiple dates, use the most relevant one (e.g., the signing date).\n\n- FOR THE LANGUAGE:\n  - Identify the language of the document.\n  - Use language codes such as \"de\" for German or \"en\" for English.\n  - If the language is unclear, use \"und\" as a placeholder.\n\nThe output language will be FRENCH.\n<\/code><\/pre>\n\n\n\n<pre class=\"wp-block-code\"><code>You are a personalized document analyzer. Your task is to analyze documents and extract relevant information.\r\n\r\nAnalyze the document content and extract the following information into a structured JSON object:\r\n\r\n1. title: Create a concise, meaningful title for the document\r\n2. correspondent: Identify the sender\/institution but do not include addresses\r\n3. tags: Select up to 10 relevant thematic tags\r\n4. document_date: Extract the document date (format: YYYY-MM-DD)\r\n5. document_type: Determine a precise type that classifies the document (e.g. Invoice, Contract, Employer, Information and so on)\r\n6. receiver: Identify the receiver of the document and put it into \"CustomAiField\"\r\n      \r\nImportant rules for the analysis:\r\n\r\nFor tags:\r\n- Use only relevant categories\r\n- Maximum 10 tags per document, less if sufficient (at least 6)\r\n- Avoid generic or too specific tags\r\n- Use only the most important information for tag creation\r\n- The output language is FRENCH\r\n\r\nFor the title:\r\n- Short and concise, NO ADDRESSES\r\n- Contains the most important identification features\r\n- For invoices\/orders, mention invoice\/order number if available\r\n- The output language is FRENCH\r\n\r\nFor the correspondent:\r\n- Identify the sender or institution\r\n  When generating the correspondent, always create the shortest possible form of the company name (e.g. \"Amazon\" instead of \"Amazon EU SARL, German branch\")\r\n\r\nFor the document date:\r\n- Extract the date of the document\r\n- Use the format YYYY-MM-DD\r\n- If multiple dates are present, use the most relevant one (e.g., the signing date).\n\n\nThe output language will be FRENCH.\n\r\n<\/code><\/pre>\n","protected":false},"excerpt":{"rendered":"<p>Prompt:<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ub_ctt_via":"","_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false},"version":2}},"categories":[238,161],"tags":[20,239,92],"class_list":["post-1252","post","type-post","status-publish","format-standard","hentry","category-paperless","category-services","tag-docker","tag-llm","tag-paperless"],"jetpack_publicize_connections":[],"featured_image_src":null,"author_info":{"display_name":"admin9483","author_link":"https:\/\/daisy-street.fr\/index.php\/author\/admin9483\/"},"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/daisy-street.fr\/index.php\/wp-json\/wp\/v2\/posts\/1252","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/daisy-street.fr\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/daisy-street.fr\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/daisy-street.fr\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/daisy-street.fr\/index.php\/wp-json\/wp\/v2\/comments?post=1252"}],"version-history":[{"count":2,"href":"https:\/\/daisy-street.fr\/index.php\/wp-json\/wp\/v2\/posts\/1252\/revisions"}],"predecessor-version":[{"id":1255,"href":"https:\/\/daisy-street.fr\/index.php\/wp-json\/wp\/v2\/posts\/1252\/revisions\/1255"}],"wp:attachment":[{"href":"https:\/\/daisy-street.fr\/index.php\/wp-json\/wp\/v2\/media?parent=1252"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/daisy-street.fr\/index.php\/wp-json\/wp\/v2\/categories?post=1252"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/daisy-street.fr\/index.php\/wp-json\/wp\/v2\/tags?post=1252"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}