Simon/romm - romm - Gitea: Git with a cup of tea

Simon/romm

mirror of https://github.com/rommapp/romm.git synced 2026-06-28 14:56:01 +00:00

Author	SHA1	Message	Date
Georges-Antoine Assi	0bfe369425	run fmt	2026-05-27 21:03:08 -04:00
Georges-Antoine Assi	acff688f11	refactor(hashing): use _make_file_hash helper at remaining sites Apply the helper to the three other per-file FileHash constructions (folder-walk hash, empty-archive fallback, single-file hash). The all-empty FileHash literals are left alone since the helper would be strictly more obscure for that case. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-27 09:12:11 -04:00
Georges-Antoine Assi	f255b5a7d9	feat(hashing): add RAR support to multi-file archive composite hashing Add read_rar_archive_files via the existing 7zz binary (which natively handles RAR3/RAR5 read), and collapse the per-extension reader dispatch into an ARCHIVE_READERS dict so future formats are one entry away. Also extract a small _make_file_hash helper to remove the repeated nested ternaries in the inner loop. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-27 09:09:37 -04:00
Georges-Antoine Assi	438c03facc	refactor(filesystem): extract archive/CHD helpers to utils/archives.py Pull file/archive readers (zip/tar/gz/bz2/7z), CHD parsing, and the shared libmagic MIME detector out of roms_handler.py into a new utils/archives.py. Rename the previously underscore-prefixed read_zip_archive_files / read_tar_archive_files to match the existing read_7z_archive_files convention, and consolidate the duplicated "with lock: detector.from_file()" pattern into a detect_mime_type helper. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-27 08:41:45 -04:00
Spinnich	242dc9e357	fix(hashing): use only default exclusions for archive internal files User-configured EXCLUDED_MULTI_PARTS_EXT/FILES are intentionally not applied to archive internal files. Archives are curated ROM sets where every file is relevant — user custom exclusions (e.g. "bin") could silently produce incorrect composite hashes. Only the hardcoded DEFAULT_EXCLUDED_FILES/EXTENSIONS (junk like .DS_Store, gamelist.xml) are applied. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-23 12:28:49 +00:00
Spinnich	a9f9ea2edc	fix(hashing): address trunk lint issues in composite archive hashing - Use AnyioPath.stat() instead of os.path.getmtime in async context (ASYNC240) - Add assert to narrow rom_md5_h/rom_sha1_h from HASH\|None to HASH (mypy/union-attr) - Auto-formatted long log.error calls in archive_7zip.py (ruff) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-23 12:14:39 +00:00
Spinnich	c20d48bbf8	feat(hashing): compute both composite hash & individual files hash for multi-file archives Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-23 12:04:04 +00:00
Georges-Antoine Assi	1be2ca2b3c	soimplify	2026-05-21 17:17:30 -04:00
copilot-swe-agent[bot]	98bc9a9eea	Optimize multi-ROM exclusion matching pass Co-authored-by: gantoine <3247106+gantoine@users.noreply.github.com>	2026-05-21 18:52:55 +00:00
copilot-swe-agent[bot]	5a1e238a5f	perf: pre-normalize exclusions once and use set for O(1) lookup in exclude_multi_roms Co-authored-by: gantoine <3247106+gantoine@users.noreply.github.com>	2026-05-21 18:50:45 +00:00
copilot-swe-agent[bot]	9e3f85b085	Fix ES-DE multi-folder exclusion matching Agent-Logs-Url: https://github.com/rommapp/romm/sessions/2213cb94-9971-48a6-8d17-9efc5c209db4 Co-authored-by: gantoine <3247106+gantoine@users.noreply.github.com>	2026-05-21 11:22:21 +00:00
Georges-Antoine Assi	94d011ee5e	tolerate launchbox basepath	2026-05-21 06:56:36 -04:00
Georges-Antoine Assi	405f678514	Merge pull request #3388 from rommapp/hardlink-resources-gamelist feat(fs): hardlink import/export assets, harden sync init	2026-05-19 09:04:18 -04:00
Georges-Antoine Assi	adb050f164	commit and push	2026-05-19 07:31:25 -04:00
Georges-Antoine Assi	f84796da08	Merge pull request #3385 from Spinnich/pr/chd-raw-hashing feat(hashing): compute raw CHD hashes and route disc-data SHA1 to Hasheous	2026-05-18 14:52:54 -04:00
Georges-Antoine Assi	591b07ec49	changes from bot review	2026-05-18 14:44:52 -04:00
Georges-Antoine Assi	e6d4ede939	cleanup	2026-05-18 07:40:59 -04:00
Georges-Antoine Assi	757fafae5f	feat(fs): hardlink import/export assets when possible, harden sync init Importer (gamelist/launchbox file:// flows) and exporters (gamelist.xml, metadata.pegasus.txt local exports) now hardlink media assets when source and destination share a filesystem, falling back transparently to a copy on EXDEV / EPERM / EOPNOTSUPP / EMLINK / EACCES (cross-device, FAT32, exFAT, network mounts, etc.). Saves disk space and is effectively instantaneous on large files (videos, manuals, miximages). Covers keep a real copy (allow_link=False) because _store_cover resizes the small cover in place via PIL.Image.save, which would truncate the shared inode and corrupt the user's source image. Also makes FSSyncHandler tolerate a missing/unwritable /romm/sync at startup: an OSError from mkdir now logs a warning instead of crashing the whole app at module-import time. Sync calls still fail at use time if the mount remains broken — the right place to surface the error. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-18 07:38:11 -04:00
Georges-Antoine Assi	90945685e4	Stuff	2026-05-17 12:43:33 -04:00
Spinnich	01f0b1d2b5	feat(hashing): compute raw CHD hashes and route disc-data SHA1 to Hasheous CHD files now follow the same hash logic as all other file types — CRC32, MD5, and SHA1 are computed from raw container bytes. This allows ScreenScraper to log KO entries for unrecognised CHD files, which it could not do when only the disc-data SHA1 was being computed. The CHD header SHA1 (disc-data SHA1) is separately extracted and stored in a new chd_sha1_hash field on RomFile, with a migration adding the column to rom_files. Hasheous receives only this disc-data SHA1 (no CRC/MD5) since it indexes disc-based games by disc-data SHA1, not raw file hashes. The RAHasher multi-file path now passes the largest CHD directly instead of a /* wildcard, which RAHasher cannot expand. Hash computations are wrapped in asyncio.to_thread to avoid blocking the event loop during large reads. Hash-lookup metadata handlers (ScreenScraper, Hasheous, Playmatch) now fall back to rom.files (stored DB hashes) when fs_rom files are not rehashed, fixing hash-based matching for UNMATCHED and UPDATE scan types. The Disc SHA-1 is displayed in the ROM detail view for both single-file (FileInfo.vue) and multi-file (FileSelectItem.vue) CHD games.	2026-05-17 08:01:05 -04:00
Georges-Antoine Assi	c6a2f56fad	Merge pull request #3367 from rommapp/regional-provider-tags Prefer ROM's own region tag for ScreenScraper and IGDB artwork	2026-05-13 11:19:53 -04:00
Georges-Antoine Assi	dad1250e15	case-insensitive region lookup for provider shortcode mapping Rom.regions can contain raw filename text like "europe" or "EUROPE" (filename parsing in roms_handler doesn't normalize casing), so the direct dict lookup missed those tags and the locale silently fell back to scan.priority.region. Replace the dict access with a helper that lowercases both sides. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-13 09:34:10 -04:00
Georges-Antoine Assi	944514acc0	prefer rom's own region tag for ScreenScraper and IGDB artwork When a ROM filename carries a region tag (e.g. (Europe)), use that region first when picking artwork and localized titles, falling back to the configured scan.priority.region. Previously the configured priority was the only signal, so a US-first config would force US covers onto European ROMs even when an EU asset was available. Adds a shared name->provider-shortcode map and threads the rom through the IGDB and SS lookup APIs so the rom-aware locale/region selection can run for both providers. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-13 09:06:11 -04:00
Georges-Antoine Assi	d8ef6f0c05	Merge branch 'master' into local-lb-fix	2026-05-09 13:20:31 -04:00
Georges-Antoine Assi	e3aaa106a2	perf(backend): reuse libmagic instance for image upload validation magic.Magic(mime=True) loads the magic database from disk on construction; instantiating it per request was adding pointless overhead to every avatar and artwork upload. Share a module-level instance guarded by a lock (the underlying magic_t handle is not thread-safe), and surface MagicException as a 400 so a sniffing failure fails closed instead of bubbling a 500. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 10:14:38 -04:00
Georges-Antoine Assi	53f14f5710	fix(backend): validate uploaded images with libmagic before storing Avatar, ROM artwork, and collection artwork uploads now sniff the file header with libmagic and reject anything that isn't PNG/JPEG/WebP/GIF, saving the file with an extension derived from the detected MIME rather than the user-supplied filename. Pairs with the raw asset endpoint, which decides inline vs attachment from the on-disk extension. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 09:18:02 -04:00
Georges-Antoine Assi	5e3a2707b0	cleanup	2026-05-03 19:39:19 -04:00
copilot-swe-agent[bot]	da005cf81a	Optimize fnmatch check and use consistent n64 filename in test Agent-Logs-Url: https://github.com/rommapp/romm/sessions/8cbbc2ca-a3e3-4c61-9e47-f8544d59231a Co-authored-by: gantoine <3247106+gantoine@users.noreply.github.com>	2026-05-03 23:36:23 +00:00
copilot-swe-agent[bot]	9593c30292	Address PR review: normalize exclusion sets, avoid duplicates, add multi-dot test for get_rom_files Agent-Logs-Url: https://github.com/rommapp/romm/sessions/8cbbc2ca-a3e3-4c61-9e47-f8544d59231a Co-authored-by: gantoine <3247106+gantoine@users.noreply.github.com>	2026-05-03 23:34:30 +00:00
copilot-swe-agent[bot]	101629628e	Simplify extension exclusion to use ends-with check instead of sub-extension iteration Agent-Logs-Url: https://github.com/rommapp/romm/sessions/a81b2023-a243-4721-bc5e-c6fa1a473a79 Co-authored-by: gantoine <3247106+gantoine@users.noreply.github.com>	2026-05-03 22:46:21 +00:00
copilot-swe-agent[bot]	55cd0cfc4f	Support compound suffix exclusions like "hash.txt" for multi-dot filenames Agent-Logs-Url: https://github.com/rommapp/romm/sessions/d1c69638-bfa0-480e-8050-d565b234ea44 Co-authored-by: gantoine <3247106+gantoine@users.noreply.github.com>	2026-05-03 01:29:04 +00:00
copilot-swe-agent[bot]	21de7e21f8	Fix file exclusion for multi-dot filenames (e.g. game.nds.hash.txt) Agent-Logs-Url: https://github.com/rommapp/romm/sessions/2f711770-100b-4e9e-a66e-ab1a74f025f8 Co-authored-by: gantoine <3247106+gantoine@users.noreply.github.com>	2026-05-02 16:50:30 +00:00
Georges-Antoine Assi	07af3d0f64	run fmt	2026-04-30 14:48:20 -04:00
Georges-Antoine Assi	962a9bfa7e	one more	2026-04-30 14:39:52 -04:00
Georges-Antoine Assi	fc8d69dc0c	update tests	2026-04-30 14:18:49 -04:00
Georges-Antoine Assi	96c3634b80	refactor: split HIGH_PRIO_STRUCTURE_PATH into STRUCTURE_PATH_A/B Replace the single HIGH_PRIO_STRUCTURE_PATH config attribute with two glob patterns (STRUCTURE_PATH_A = roms/, STRUCTURE_PATH_B = /roms) and update all call sites to detect Structure B via glob.glob, defaulting to Structure A when no match is found. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 12:52:22 -04:00
Georges-Antoine Assi	40689d7e39	Fix LaunchBox local-media file:// paths resolving under library root LaunchBox produced file:// URIs relative to /romm/launchbox, but the resources handler resolved them under /romm/library via fs_rom_handler, so local images/manuals/screenshots were never found. Switch LaunchBox to a distinct launchbox-file:// scheme and add FSLaunchboxHandler + _resolve_local_file_uri to route each scheme to the correct root. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-17 22:59:00 -04:00
Georges-Antoine Assi	e8a6e9f01d	final fixes	2026-04-12 18:43:24 -04:00
Georges-Antoine Assi	d45afb5dde	more fixes	2026-04-12 18:32:15 -04:00
Georges-Antoine Assi	628d8d8bae	refactor: pass RAGamesPlatform dict into calculate_hash, normalize extension Callers now pass the full platform dict and rom.fs_extension; the service normalizes the extension (optional leading dot, case-insensitive) before checking the compressed-archive skip set, so ROMs stored with bare extensions like "zip" correctly hit the skip path. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 18:05:44 -04:00
Georges-Antoine Assi	8f1b8f41d7	perf: skip RAHasher subprocess for archived disc-platform ROMs RAHasher was being spawned for every hashable ROM regardless of file type. When the source file is a zip/7z/tar and the RA platform needs an on-disk disc image (PSX, PS2, PSP, Saturn, Dreamcast, Sega CD, 3DO, PC-FX, Neo Geo CD, TurboGrafx CD, Atari Jaguar CD, Wii), the subprocess fails with "Unsupported console for buffer hash: {id}" after paying full process-spawn overhead per ROM — a serious slowdown when indexing large zipped collections (e.g. myrient PS2/PSP sets). calculate_hash now short-circuits those combinations with a debug log and no subprocess. Raw disc images (.iso, .chd, .cue/.bin) and archives on cartridge platforms still go through RAHasher as before. Also centralize COMPRESSED_FILE_EXTENSIONS in utils/filesystem.py so roms_handler (is_compressed_file / hashing), rahasher (skip logic), and feeds (PKGi passthrough) share one source of truth. The shared set adds .rar, which is_compressed_file now recognizes too. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 17:18:14 -04:00
Georges-Antoine Assi	f9f3dfd927	changes from bot review	2026-04-12 09:50:54 -04:00
Georges-Antoine Assi	6db9d45928	actually fix	2026-04-07 22:53:44 -04:00
Georges-Antoine Assi	5529fdad7b	Merge branch 'master' into romm-3232	2026-04-07 22:39:22 -04:00
Georges-Antoine Assi	f227a3145d	changes from bot review	2026-04-07 22:32:40 -04:00
Georges-Antoine Assi	6c88e098ba	[ROMM-3232] Fix content_hash not updated	2026-04-07 21:48:51 -04:00
Alex Vanderveen	6aca8fdfcf	Parse Content-Type essence before validating resource downloads Problem _check_content_type used the full Content-Type header string (lowercased) and matched it with startswith(...) against allowed prefixes. That is mostly fine when the server sends a bare type like application/pdf. It breaks down when vendors send parameters on the same header (e.g. name="…", charset=…). In theory application/force-download; name="…" should still start with application/force-download, but in practice you can get: Leading whitespace or a UTF‑8 BOM before the type token, so the string no longer starts with your prefix even though the MIME type is correct. Confusing logs: logging only the lowercased full header is fine, but the decision should be based on the standardized MIME essence (type + subtype, no parameters), which is what other stacks use for “what is this?” So the fix is to parse the header the usual way and only then apply your allowlist. What changed _content_type_essence(header_value) Takes everything before the first ; (the essence). Strips whitespace, lowercases, strips a leading BOM (\ufeff) so odd clients/proxies don’t break the check. _check_content_type Reads the raw content-type header once. Runs startswith on the essence, not on the full header with parameters. Rejects if the essence is empty (missing or useless header). Logging uses the raw header string (or (missing header)), so operators still see exactly what the server sent. Call sites and allowed prefixes (image/, application/pdf, etc.) are unchanged; only how the string is normalized before comparison changes. Security / SSRF This does not replace URL / SSRF controls; it only makes post-fetch type checking consistent with how Content-Type is defined (essence vs parameters). You are not widening the allowlist—same prefixes, stricter handling of “empty” and clearer matching on the actual type token. Risk / regression Low: same allowed prefixes, strictly more tolerant of benign formatting (whitespace, BOM, parameters). The only stricter case is empty essence after strip (e.g. malformed header), which correctly fails the check. \\\\\\\\\\\\\\\\\\\\\\\\\\\\\ I have reviewed the proposal and these edits will handle cases where the string we match against for the content_type is cleaned up more before comparing against the allow list of content_types. I have tested this, and confirm that I do not get any errors loading PDFs for game manuals using this. Please consider this, as this should be compatible with the existing content type allowlist, and easily work with any new types added to it.	2026-04-07 20:25:55 -04:00
Georges-Antoine Assi	b36a8b0cdf	manual types should support octet-stream and force-download response	2026-04-07 14:22:43 -04:00
Georges-Antoine Assi	2dc1678931	changes from bot review	2026-04-06 11:22:44 -04:00
Georges-Antoine Assi	f2619ac0d1	Merge branch 'master' into pegasus-metadata-export	2026-04-06 11:06:08 -04:00

1 2 3 4 5 ...

273 Commits