Extract the readable content from the Web Page

#25

by gunjansa - opened 16 days ago

16 days ago

•

Hi, I am trying to use the NuExtract for extracting all the text content from the Web Page which is suitable for reading for the user.
But I am unable to create a suitable prompt. Any recommendations will be helpful. Different templates used are:

template = """{
"Page Title": "",
"Content": ""
}"""

template = """{
"Page Title": "",
"main_text": ""
}"""

template = """{
"title": "",
"author": "",
"published_date": "",
"article_paragraphs": []
}"""

template = """{
"title": "",
"author": "",
"published_date": "",
"full_article_text": "",
"language": ""
}"""

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment