Add greenmask for randomized database imports#1502
Conversation
| "venv": ["uv", "sync", "--all-extras", "--all-groups"], | ||
| "npm": ["npm", "install"] | ||
| "npm": ["npm", "install"], | ||
| "greenmask": "curl -fsSL https://greenmask.io/install.sh | sh -s -- -y -v v0.2.21" |
There was a problem hiding this comment.
Can this be put into the Dockerfile?
There was a problem hiding this comment.
It could, but it depends on features e.g. heroku and psql. Ugh, no it doesn't. What is the heuristic for what should be in the image vs somewhere else? If there were a greenmask feature I assume we'd use that like we do with others.
Yeah there's 2 options, either
We compute most of our counts with elasticsearch so this is unfortunately expected. |
|
Thanks @danlamanna for helping me resolve those issues. I can now use the placeholder or proxy mode for images, and running the |
15de9e2 to
c390129
Compare

This adds https://docs.greenmask.io/latest as a development utility which makes it easy to get a small subset of real, redacted, production data.
@annehaley can you give
dev/greenmask-dump.sha try locally and let me know how it works? You may want to run the server withexport DJANGO_ISIC_FAKE_STORAGE=placeholdersince there won't be any real images.