mealie-recipes / mealie

Mealie is a self hosted recipe manager and meal planner with a RestAPI backend and a reactive frontend application built in Vue for a pleasant user experience for the whole family. Easily add recipes into your database by providing the url and mealie will automatically import the relevant data or add a family recipe with the UI editor
https://docs.mealie.io
GNU Affero General Public License v3.0
5.59k stars 608 forks source link

[AllRecipes] - No ingredient are shown #3787

Closed zebus closed 4 days ago

zebus commented 4 days ago

First Check

Please provide 1-5 example URLs that are having errors

https://www.allrecipes.com/recipe/277009/easy-one-skillet-ground-beef-burrito/

Ingredients show up on the debug parser but for some reason don't make it into recipe when doing the actual import, verified with demo site and other discord user. Also parses correctly in tandoor which I believe uses same parser.

Please provide your logs for the Mealie container docker logs <container-id> > mealie.logs

Switching to dedicated user

        User uid:    1000
        User gid:    1000

INFO     2024-06-24T12:41:36 - Started server process [1]
INFO     2024-06-24T12:41:36 - Waiting for application startup.
INFO     2024-06-24T12:41:36 - start: database initialization
INFO     2024-06-24T12:41:36 - Database connection established.
INFO     2024-06-24T12:41:36 - Context impl PostgresqlImpl.
INFO     2024-06-24T12:41:36 - Will assume transactional DDL.
INFO     2024-06-24T12:41:36 - Migration needed. Performing migration...
INFO     2024-06-24T12:41:36 - Context impl PostgresqlImpl.
INFO     2024-06-24T12:41:36 - Will assume transactional DDL.
INFO     2024-06-24T12:41:36 - Running upgrade d7c6efd2de42 -> 7788478a0338, add group recipe actions
INFO     2024-06-24T12:41:36 - Checking for migration data fixes
INFO     2024-06-24T12:41:36 - end: database initialization
INFO     2024-06-24T12:41:36 - -----SYSTEM STARTUP-----
INFO     2024-06-24T12:41:36 - ------APP SETTINGS------
INFO     2024-06-24T12:41:36 - {
    "theme": {
        "light_primary": "#E58325",
        "light_accent": "#007A99",
        "light_secondary": "#973542",
        "light_success": "#43A047",
        "light_info": "#1976D2",
        "light_warning": "#FF6D00",
        "light_error": "#EF5350",
        "dark_primary": "#E58325",
        "dark_accent": "#007A99",
        "dark_secondary": "#973542",
        "dark_success": "#43A047",
        "dark_info": "#1976D2",
        "dark_warning": "#FF6D00",
        "dark_error": "#EF5350"
    },
    "PRODUCTION": true,
    "BASE_URL": "http://jurjurbox.lynx-hops.ts.net:6060",
    "STATIC_FILES": "/spa/static",
    "IS_DEMO": false,
    "HOST_IP": "*",
    "API_HOST": "0.0.0.0",
    "API_PORT": 9000,
    "API_DOCS": true,
    "TOKEN_TIME": 48,
    "LOG_CONFIG_OVERRIDE": null,
    "LOG_LEVEL": "info",
    "GIT_COMMIT_HASH": "d96c36333b9cb9461c5dee96ae28b60d912b38fd",
    "ALLOW_SIGNUP": true,
    "DAILY_SCHEDULE_TIME": "23:45",
    "SECURITY_MAX_LOGIN_ATTEMPTS": 5,
    "SECURITY_USER_LOCKOUT_TIME": 24,
    "DB_ENGINE": "postgres",
    "DEFAULT_GROUP": "Home",
    "SMTP_HOST": null,
    "SMTP_PORT": "587",
    "SMTP_FROM_NAME": "Mealie",
    "SMTP_FROM_EMAIL": null,
    "SMTP_AUTH_STRATEGY": "TLS",
    "LDAP_AUTH_ENABLED": false,
    "LDAP_SERVER_URL": null,
    "LDAP_TLS_INSECURE": false,
    "LDAP_TLS_CACERTFILE": null,
    "LDAP_ENABLE_STARTTLS": false,
    "LDAP_BASE_DN": null,
    "LDAP_QUERY_BIND": null,
    "LDAP_QUERY_PASSWORD": null,
    "LDAP_USER_FILTER": null,
    "LDAP_ADMIN_FILTER": null,
    "LDAP_ID_ATTRIBUTE": "uid",
    "LDAP_MAIL_ATTRIBUTE": "mail",
    "LDAP_NAME_ATTRIBUTE": "name",
    "OIDC_AUTH_ENABLED": false,
    "OIDC_CLIENT_ID": null,
    "OIDC_CONFIGURATION_URL": null,
    "OIDC_SIGNUP_ENABLED": true,
    "OIDC_USER_GROUP": null,
    "OIDC_ADMIN_GROUP": null,
    "OIDC_AUTO_REDIRECT": false,
    "OIDC_PROVIDER_NAME": "OAuth",
    "OIDC_REMEMBER_ME": false,
    "OIDC_SIGNING_ALGORITHM": "RS256",
    "OIDC_USER_CLAIM": "email",
    "OIDC_GROUPS_CLAIM": "groups",
    "OIDC_TLS_CACERTFILE": null,
    "OPENAI_BASE_URL": null,
    "OPENAI_API_KEY": null,
    "OPENAI_MODEL": "gpt-4o",
    "OPENAI_WORKERS": 2,
    "OPENAI_SEND_DATABASE_DATA": true,
    "TESTING": false
}
INFO     2024-06-24T12:41:36 - Daily tasks scheduled for 2024-06-24 23:45:00
INFO     2024-06-24T12:41:36 - Application startup complete.
INFO     2024-06-24T12:41:36 - Uvicorn running on http://0.0.0.0:9000 (Press CTRL+C to quit)
INFO     2024-06-24T12:41:36 - [100.97.149.26:57299] 200 OK "GET / HTTP/1.1"
INFO     2024-06-24T12:41:36 - [100.97.149.26:57299] 200 OK "GET /_nuxt/cad0bdc.js HTTP/1.1"
INFO     2024-06-24T12:41:36 - [100.97.149.26:57298] 200 OK "GET /_nuxt/c3a57f2.js HTTP/1.1"
INFO     2024-06-24T12:41:36 - [100.97.149.26:57300] 200 OK "GET /_nuxt/a72ba95.js HTTP/1.1"
INFO     2024-06-24T12:41:36 - [100.97.149.26:57301] 200 OK "GET /_nuxt/ed6c1c7.js HTTP/1.1"
INFO     2024-06-24T12:41:37 - [100.97.149.26:57300] 200 OK "GET /_nuxt/880ef18.js HTTP/1.1"
INFO     2024-06-24T12:41:37 - [100.97.149.26:57298] 200 OK "GET /_nuxt/d9d9b49.js HTTP/1.1"
INFO     2024-06-24T12:41:37 - [100.97.149.26:57298] 200 OK "GET /favicon.ico HTTP/1.1"
INFO     2024-06-24T12:41:37 - [100.97.149.26:57300] 200 OK "GET /icons/android-chrome-maskable-512x512.png HTTP/1.1"
INFO     2024-06-24T12:41:37 - [100.97.149.26:57300] 200 OK "GET /api/app/about/theme HTTP/1.1"
INFO     2024-06-24T12:41:38 - [100.97.149.26:57300] 401 Unauthorized "GET /api/users/self HTTP/1.1"
INFO     2024-06-24T12:41:38 - [100.97.149.26:57301] 200 OK "GET /_nuxt/fonts/Roboto-400-latin21.b009a76.woff2 HTTP/1.1"
INFO     2024-06-24T12:41:38 - [100.97.149.26:57300] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:41:38 - [100.97.149.26:57298] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:41:38 - [100.97.149.26:57300] 200 OK "GET /_nuxt/8423b77.js HTTP/1.1"
INFO     2024-06-24T12:41:38 - [100.97.149.26:57300] 200 OK "GET /icons/apple-touch-icon.png HTTP/1.1"
INFO     2024-06-24T12:41:38 - [100.97.149.26:57300] 200 OK "GET /_nuxt/fonts/Roboto-500-latin28.f25d774.woff2 HTTP/1.1"
INFO     2024-06-24T12:41:38 - [100.97.149.26:57298] 200 OK "GET /api/app/about/startup-info HTTP/1.1"
INFO     2024-06-24T12:41:38 - [100.97.149.26:57301] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:41:42 - [100.97.149.26:57300] 200 OK "POST /api/auth/token HTTP/1.1"
INFO     2024-06-24T12:41:42 - [100.97.149.26:57300] 200 OK "GET /api/users/self HTTP/1.1"
INFO     2024-06-24T12:41:42 - [100.97.149.26:57300] 200 OK "GET /_nuxt/d968724.js HTTP/1.1"
INFO     2024-06-24T12:41:42 - [100.97.149.26:57301] 200 OK "GET /_nuxt/2c095ce.js HTTP/1.1"
INFO     2024-06-24T12:41:42 - [100.97.149.26:57298] 200 OK "GET /_nuxt/08b892e.js HTTP/1.1"
INFO     2024-06-24T12:41:42 - [100.97.149.26:57300] 200 OK "GET /_nuxt/905481a.js HTTP/1.1"
INFO     2024-06-24T12:41:42 - [100.97.149.26:57315] 200 OK "GET /_nuxt/5d8cce2.js HTTP/1.1"
INFO     2024-06-24T12:41:43 - [100.97.149.26:57315] 200 OK "GET /api/organizers/tags?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:41:43 - [100.97.149.26:57316] 200 OK "GET /api/organizers/tools?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:41:43 - [100.97.149.26:57300] 200 OK "GET /api/groups/cookbooks?page=1&perPage=-1&orderBy=position&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:41:43 - [100.97.149.26:57298] 200 OK "GET /api/organizers/categories?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:41:43 - [100.97.149.26:57317] 200 OK "GET /api/recipes?page=1&perPage=64&orderBy=created_at&orderDirection=desc&paginationSeed=1719250902780&searchSeed=1719250902780&search=&requireAllCategories=false&requireAllTags=false&requireAllTools=false&requireAllFoods=false HTTP/1.1"
INFO     2024-06-24T12:41:43 - [100.97.149.26:57315] 200 OK "GET /api/media/users/14d37274-995e-4497-97de-5352b56441ac/profile.webp?cacheKey=1234 HTTP/1.1"
INFO     2024-06-24T12:41:43 - [100.97.149.26:57301] 200 OK "GET /api/foods?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:41:43 - [100.97.149.26:57315] 200 OK "GET /api/media/recipes/7252273b-5cff-4615-a901-55a548b7b42f/images/min-original.webp?rnd=1&version=kyiC HTTP/1.1"
INFO     2024-06-24T12:41:43 - [100.97.149.26:57300] 200 OK "GET /api/users/self/ratings HTTP/1.1"
INFO     2024-06-24T12:41:43 - [100.97.149.26:57301] 200 OK "GET /api/groups/recipe-actions?page=1&perPage=-1&orderBy=title&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:41:43 - [100.97.149.26:57298] 200 OK "GET /api/groups/self HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57301] 200 OK "GET /_nuxt/e010612.js HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57317] 200 OK "GET /_nuxt/4fdbc8b.js HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57298] 200 OK "GET /_nuxt/3efb961.js HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57315] 200 OK "GET /_nuxt/707b470.js HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57300] 200 OK "GET /_nuxt/59e7cbf.js HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57300] 200 OK "GET /api/recipes/easy-one-skillet-ground-beef-burrito HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57300] 200 OK "GET /api/media/recipes/7252273b-5cff-4615-a901-55a548b7b42f/images/original.webp?rnd=1&version=kyiC HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57315] 200 OK "GET /_nuxt/fonts/Roboto-700-latin35.227c931.woff2 HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57298] 200 OK "GET /api/users/self/ratings HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57315] 200 OK "GET /_nuxt/assets/audio/kitchen_alarm.mp3 HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57317] 200 OK "GET /api/recipes/easy-one-skillet-ground-beef-burrito/comments HTTP/1.1"
INFO     2024-06-24T12:41:46 - [100.97.149.26:57301] 200 OK "GET /api/organizers/tools?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:42:01 - [127.0.0.1:33314] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:42:05 - [100.97.149.26:57327] 200 OK "GET /api/organizers/tags?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:42:05 - [100.97.149.26:57328] 200 OK "GET /api/organizers/tools?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:42:05 - [100.97.149.26:57326] 200 OK "GET /api/organizers/categories?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:42:05 - [100.97.149.26:57325] 200 OK "GET /api/units?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:42:11 - [100.97.149.26:57330] 200 OK "GET /_nuxt/4653766.js HTTP/1.1"
INFO     2024-06-24T12:42:11 - [100.97.149.26:57329] 200 OK "GET /_nuxt/3d2578b.js HTTP/1.1"
INFO     2024-06-24T12:42:11 - [100.97.149.26:57331] 200 OK "GET /api/recipes/easy-one-skillet-ground-beef-burrito/comments HTTP/1.1"
INFO     2024-06-24T12:42:31 - [127.0.0.1:33142] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:43:01 - [127.0.0.1:47518] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:43:31 - [127.0.0.1:35014] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:44:02 - [127.0.0.1:36992] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:44:32 - [127.0.0.1:35072] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:45:02 - [127.0.0.1:50818] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:45:32 - [127.0.0.1:46660] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:46:03 - [127.0.0.1:34738] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:46:33 - [127.0.0.1:57922] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:47:03 - [127.0.0.1:43238] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:47:33 - [127.0.0.1:55208] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:48:03 - [127.0.0.1:55202] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:48:34 - [127.0.0.1:54640] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:49:04 - [127.0.0.1:51472] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:49:34 - [127.0.0.1:33450] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:50:04 - [127.0.0.1:55074] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:50:35 - [127.0.0.1:51794] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:51:05 - [127.0.0.1:39982] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:51:35 - [127.0.0.1:44400] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:52:05 - [127.0.0.1:54722] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:52:36 - [127.0.0.1:49380] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:53:06 - [127.0.0.1:56412] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:53:36 - [127.0.0.1:36984] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:54:04 - [100.97.149.26:57605] 200 OK "GET /api/organizers/tags?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:54:04 - [100.97.149.26:57606] 200 OK "GET /api/organizers/tools?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:54:04 - [100.97.149.26:57604] 200 OK "GET /api/organizers/categories?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:54:04 - [100.97.149.26:57607] 200 OK "GET /api/recipes?page=1&perPage=64&orderBy=created_at&orderDirection=desc&paginationSeed=1719251644061&searchSeed=1719251644061&search=&requireAllCategories=false&requireAllTags=false&requireAllTools=false&requireAllFoods=false HTTP/1.1"
INFO     2024-06-24T12:54:04 - [100.97.149.26:57607] 200 OK "GET /api/users/self/ratings HTTP/1.1"
INFO     2024-06-24T12:54:06 - [127.0.0.1:53106] 200 OK "GET /api/app/about HTTP/1.1"
INFO     2024-06-24T12:54:13 - [100.97.149.26:57611] 200 OK "GET /_nuxt/76fec0e.js HTTP/1.1"
INFO     2024-06-24T12:54:13 - [100.97.149.26:57610] 200 OK "GET /_nuxt/69cbe0d.js HTTP/1.1"
INFO     2024-06-24T12:54:14 - [100.97.149.26:57610] 200 OK "GET /_nuxt/img/recipes-create.ca6e04e.svg HTTP/1.1"
INFO     2024-06-24T12:54:14 - [100.97.149.26:57611] 200 OK "GET /api/organizers/tags?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:54:16 - HTTP Request: GET https://www.allrecipes.com/recipe/277009/easy-one-skillet-ground-beef-burrito/ "HTTP/1.1 200 OK"
INFO     2024-06-24T12:54:16 - Image URL: ['https://www.allrecipes.com/thmb/hMehCKo_SsU_B23Kg2_RFp9Msb0=/1500x0/filters:no_upscale():max_bytes(150000):strip_icc()/Easy-One-Skillet-Ground-Beef-Burrito-c972b363dcfa4de4b97f2d782770a9c6.jpeg']
INFO     2024-06-24T12:54:16 - HTTP Request: HEAD https://www.allrecipes.com/thmb/hMehCKo_SsU_B23Kg2_RFp9Msb0=/1500x0/filters:no_upscale():max_bytes(150000):strip_icc()/Easy-One-Skillet-Ground-Beef-Burrito-c972b363dcfa4de4b97f2d782770a9c6.jpeg "HTTP/1.1 200 OK"
INFO     2024-06-24T12:54:16 - HTTP Request: GET https://www.allrecipes.com/thmb/hMehCKo_SsU_B23Kg2_RFp9Msb0=/1500x0/filters:no_upscale():max_bytes(150000):strip_icc()/Easy-One-Skillet-Ground-Beef-Burrito-c972b363dcfa4de4b97f2d782770a9c6.jpeg "HTTP/1.1 200 OK"
INFO     2024-06-24T12:54:16 - original.jpg minified
INFO     2024-06-24T12:54:16 - Tiny image saved
INFO     2024-06-24T12:54:16 - [100.97.149.26:57611] 201 Created "POST /api/recipes/create-url HTTP/1.1"
INFO     2024-06-24T12:54:17 - [100.97.149.26:57611] 200 OK "GET /api/recipes/easy-one-skillet-ground-beef-burrito-1 HTTP/1.1"
INFO     2024-06-24T12:54:17 - [100.97.149.26:57611] 200 OK "GET /api/media/recipes/4ccf7aa5-db2d-46de-9695-a1c8d745882b/images/original.webp?rnd=1&version=ezfD HTTP/1.1"
INFO     2024-06-24T12:54:17 - [100.97.149.26:57610] 200 OK "GET /api/users/self/ratings HTTP/1.1"
INFO     2024-06-24T12:54:17 - [100.97.149.26:57610] 200 OK "GET /_nuxt/assets/audio/kitchen_alarm.mp3 HTTP/1.1"
INFO     2024-06-24T12:54:17 - [100.97.149.26:57613] 200 OK "GET /api/recipes/easy-one-skillet-ground-beef-burrito-1/comments HTTP/1.1"
INFO     2024-06-24T12:54:17 - [100.97.149.26:57612] 200 OK "GET /api/organizers/tools?page=1&perPage=-1&orderBy=name&orderDirection=asc HTTP/1.1"
INFO     2024-06-24T12:54:37 - [127.0.0.1:55224] 200 OK "GET /api/app/about HTTP/1.1"

Deployment

Docker (Linux)

michael-genson commented 4 days ago

This is a defect in recipe scrapers. For some reason there's additional processing on recipe ingredients for allrecipes.com (my guess is that it used to be formatted strangely), but now it seems to be broken.

See this commit from December 2023: https://github.com/hhursev/recipe-scrapers/commit/ac101d7fad8b6cebe78b9588662761f2661f1101

Manually reverting this fixes the issue locally. I'll do some investigation and post this over on recipe scrapers.

michael-genson commented 4 days ago

FYI: https://github.com/hhursev/recipe-scrapers/issues/1154

Closing this issue since we'll automatically get the fix once it's updated over there