|
|
|
@@ -83,10 +83,6 @@ Merges all `byty_*.json` files into `byty_merged.json` and generates `mapa_bytu.
|
|
|
|
|
|
|
|
|
|
|
|
**Deduplication logic:** Two listings are considered duplicates if they share the same normalized street name + price + area. PSN and CityHome have priority during dedup (loaded first), so their listings are kept over duplicates from other portals.
|
|
|
|
**Deduplication logic:** Two listings are considered duplicates if they share the same normalized street name + price + area. PSN and CityHome have priority during dedup (loaded first), so their listings are kept over duplicates from other portals.
|
|
|
|
|
|
|
|
|
|
|
|
### `regen_map.py`
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Regenerates the map from existing `byty_sreality.json` data without re-scraping. Fetches missing area values from the Sreality API, fixes URLs, and re-applies the area filter. Useful for tweaking map output after data has already been collected.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
## Interactive map (`mapa_bytu.html`)
|
|
|
|
## Interactive map (`mapa_bytu.html`)
|
|
|
|
|
|
|
|
|
|
|
|
The generated map is a standalone HTML file using Leaflet.js with CARTO basemap tiles. Features:
|
|
|
|
The generated map is a standalone HTML file using Leaflet.js with CARTO basemap tiles. Features:
|
|
|
|
@@ -201,7 +197,6 @@ Validation targets run scrapers with `--max-pages 1 --max-properties 10` for a f
|
|
|
|
├── scrape_psn.py # PSN scraper
|
|
|
|
├── scrape_psn.py # PSN scraper
|
|
|
|
├── scrape_cityhome.py # CityHome scraper
|
|
|
|
├── scrape_cityhome.py # CityHome scraper
|
|
|
|
├── merge_and_map.py # Merge all sources + generate final map
|
|
|
|
├── merge_and_map.py # Merge all sources + generate final map
|
|
|
|
├── regen_map.py # Regenerate map from cached Sreality data
|
|
|
|
|
|
|
|
├── run_all.sh # Orchestrator script (runs all scrapers + merge)
|
|
|
|
├── run_all.sh # Orchestrator script (runs all scrapers + merge)
|
|
|
|
├── mapa_bytu.html # Generated interactive map (output)
|
|
|
|
├── mapa_bytu.html # Generated interactive map (output)
|
|
|
|
├── Makefile # Docker management + validation shortcuts
|
|
|
|
├── Makefile # Docker management + validation shortcuts
|
|
|
|
|