add command to reshape continuous questions#3789

Draft

lsabor wants to merge 8 commits intomainfrom

feat/3707/reshape-continuous-command

Contributor

lsabor commented Nov 13, 2025 •

edited

Loading

closes #3707

adds command: python manage.py reshape_continuous_question
params:

--question_id 12345
[--make_copy]
[--alter_copy]
[--approve_copy_post]
[--nominal_range_min 2020-01-01]
[--nominal_range_max 2300-01-01]
[--convert_to_discrete]
[--step 1.0]
[--new_scheduled_close_time 2300-01-01]

This command will be rarely (hopefully essentially never) used and the code doesn't need to be polished. But it should be checked over for any logic faults.


          add command to reshape continuous questions

91a9054

lsabor temporarily deployed to testing_env

November 13, 2025 20:08

— with

GitHub Actions Inactive

lsabor had a problem deploying to testing_env

November 13, 2025 20:08

— with

GitHub Actions Failure

lsabor requested review from elisescu and hlbmtc

November 13, 2025 20:10

lsabor commented

View reviewed changes

questions/management/commands/reshape_continuous_question.py

Contributor Author

lsabor Nov 13, 2025

For review purposes, don't worry about the math functions regarding splines.
Reviewers can probably start around line 293


          move atomic transaction for reshape question to bulk forecast update

24466a8

lsabor had a problem deploying to testing_env

November 13, 2025 20:29

— with

GitHub Actions Error

lsabor had a problem deploying to testing_env

November 13, 2025 20:29

— with

GitHub Actions Error


          use bulk_create for make_copy_of_question forecast creation

baa7ffa

lsabor had a problem deploying to testing_env

November 13, 2025 20:31

— with

GitHub Actions Failure

lsabor temporarily deployed to testing_env

November 13, 2025 20:31

— with

GitHub Actions Inactive


          linting

ed1b5f9

lsabor temporarily deployed to testing_env

November 13, 2025 20:37

— with

GitHub Actions Inactive

lsabor temporarily deployed to testing_env

November 13, 2025 20:37

— with

GitHub Actions Inactive

hlbmtc reviewed

View reviewed changes

questions/management/commands/reshape_continuous_question.py

Comment on lines +293 to +296

+                  new_question = Question.objects.get(id=question.id)
+                  new_question.id = None
+                  new_question.group_id = None
+                  new_question.save()

Contributor

hlbmtc Dec 3, 2025

Let's replace with questions.services.common.clone_question

hlbmtc reviewed

View reviewed changes

questions/management/commands/reshape_continuous_question.py

Comment on lines +304 to +316

+                  for k, v in new_post.__dict__.items():
+                      if (
+                          k.startswith("_")
+                          or k == "id"
+                          or k == "group_of_questions_id"
+                          or k == "conditional_id"
+                      ):
+                          pass
+                      elif k == "question_id":
+                          post_dict[k] = new_question.id
+                      else:
+                          post_dict[k] = v
+                  new_post = Post(**post_dict)

Contributor

hlbmtc Dec 3, 2025

Let's move this to the separate clone_post function:

def clone_post(post: Post, **kwargs):
    ...

clone_post(post, question=new_question)

hlbmtc reviewed

View reviewed changes

questions/management/commands/reshape_continuous_question.py

Comment on lines +323 to +331

+                  new_forecasts: list[Forecast] = []
+                  for forecast in original_forecasts.iterator(chunk_size=100):
+                      forecast.id = None
+                      forecast.pk = None
+                      forecast.question = new_question
+                      forecast.post = new_post
+                      new_forecasts.append(forecast)
+                  if new_forecasts:
+                      Forecast.objects.bulk_create(new_forecasts, batch_size=500)

Contributor

hlbmtc Dec 3, 2025

The issue is that you keep the entire new_forecasts list in memory, which can eat a lot of RAM.

I already have a nice util for batched updates — utils.models.ModelBatchUpdater. Maybe you can create a similar helper, but for batched creation?

Something like:

class ModelBatchCreator(ModelBatchUpdater):
    def __init__(
        self,
        model_class: type[DjangoModelType],
        batch_size: int = 100,
    ):
        self.model_class = model_class
        self.batch_size = batch_size

        self._batch: list[DjangoModelType] = []

    def append(self, obj: DjangoModelType) -> None:
        self._batch.append(obj)

        if len(self._batch) >= self.batch_size:
            self.flush()

    def flush(self) -> None:
        if self._batch:
            self.model_class.objects.bulk_create(self._batch)
            self._batch.clear()

    def __enter__(self):
        return self

    def __exit__(self, exc_type, exc_value, traceback):
        self.flush()

And then use it like:

# Updating posts
with ModelBatchCreator(
    model_class=Forecast, batch_size=500
) as creator:
    for idx, forecast in enumerate(original_forecasts.iterator(chunk_size=500)):
        forecast.id = None
        forecast.pk = None
        forecast.question = new_question
        forecast.post = new_post
        creator.append(forecast)

        if idx % batch_size == 0:
            logger.info(f"Created {idx}/{total} forecasts")

hlbmtc reviewed

View reviewed changes

questions/management/commands/reshape_continuous_question.py

+                      question_to_change.scheduled_resolve_time = new_scheduled_close_time
+                  question_to_change.save()
+                  post = question_to_change.get_post()
+                  assert post

Contributor

hlbmtc Dec 3, 2025

assert won't work in python production mode, please replace with explicit exceptions raise

hlbmtc reviewed

View reviewed changes

questions/management/commands/reshape_continuous_question.py

+                          new_cdf = np.cumsum(new_pmf).tolist()[:-1]
+                      return new_cdf
+                  print("rescaling forecasts...")

Contributor

hlbmtc Dec 3, 2025

Let's replace it with logging

hlbmtc reviewed

View reviewed changes

questions/management/commands/reshape_continuous_question.py

Comment on lines +454 to +469

+                  for i, forecast in enumerate(forecasts.iterator(chunk_size=100), 1):
+                      print(i, "/", c, end="\r")
+                      forecast.continuous_cdf = transform_cdf(forecast.continuous_cdf)
+                      forecast.distribution_input = None
+                      updated_forecasts.append(forecast)
+                  print()
+                  print("Done")
+                  if updated_forecasts:
+                      print("Saving forecasts...", end="\r")
+                      with transaction.atomic():
+                          Forecast.objects.bulk_update(
+                              updated_forecasts,
+                              fields=["continuous_cdf", "distribution_input"],
+                              batch_size=500,
+                          )
+                      print("Saving forecasts... DONE")

Contributor

hlbmtc Dec 3, 2025

Same here -- let's use ModelBatchUpdater

hlbmtc reviewed

View reviewed changes

questions/management/commands/reshape_continuous_question.py

Comment on lines +611 to +615

+                      if question.type not in [
+                          Question.QuestionType.NUMERIC,
+                          Question.QuestionType.DISCRETE,
+                          Question.QuestionType.DATE,
+                      ]:

Contributor

hlbmtc Dec 3, 2025 •

edited

Loading

if question.type not in QUESTION_CONTINUOUS_TYPES

lsabor added 3 commits

December 3, 2025 10:34


          make atomic transation

b480039


          Merge branch 'main' of github.com:Metaculus/metaculus into feat/3707/…

0571bee

…reshape-continuous-command


          fix build_question_forecasts import

430de52

lsabor temporarily deployed to testing_env

December 3, 2025 18:34

— with

GitHub Actions Inactive

lsabor temporarily deployed to testing_env

December 3, 2025 18:34

— with

GitHub Actions Inactive


          Merge branch 'main' of github.com:Metaculus/metaculus into feat/3707/…

0f9d479

…reshape-continuous-command

lsabor temporarily deployed to testing_env

January 10, 2026 19:47

— with

GitHub Actions Inactive

lsabor temporarily deployed to testing_env

January 10, 2026 19:47

— with

GitHub Actions Inactive

lsabor marked this pull request as draft

February 8, 2026 20:41

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet