Test latest sqsgenerator version by jan-janssen · Pull Request #470 · pyiron/structuretoolkit

jan-janssen · 2026-03-27T09:00:33Z

Summary by CodeRabbit

Chores
- Updated package dependencies to newer compatible versions.
Breaking Changes
- Restructured the structure generation function by removing several optional configuration parameters, simplifying its interface.
Tests
- Updated test suite to validate function signature and behavior changes.

for more information, see https://pre-commit.ci

coderabbitai · 2026-03-27T09:00:49Z

Warning

Rate limit exceeded

@jan-janssen has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 12 minutes and 17 seconds before requesting another review.

Your organization is not enrolled in usage-based pricing. Contact your admin to enable usage-based pricing to continue reviews beyond the rate limit, or try again in 12 minutes and 17 seconds.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 6ea69be1-637d-43e0-9e36-b25e37e2ee86

📥 Commits

Reviewing files that changed from the base of the PR and between b06fb57 and 58d82bf.

📒 Files selected for processing (5)

.ci_support/environment.yml
.github/workflows/pipeline.yml
pyproject.toml
src/structuretoolkit/build/sqs.py
tests/test_sqs.py

📝 Walkthrough

Walkthrough

Updated sqsgenerator dependencies across CI environments (from 0.2→0.4 and 0.3→0.5.4) and refactored the sqs_structures function to align with the new API, removing deprecated parameters and restructuring settings payload and result extraction logic.

Changes

Cohort / File(s)	Summary
CI Dependencies `.ci_support/environment-old.yml`, `.ci_support/environment.yml`	Updated `sqsgenerator` pinned versions to align with API changes: 0.2→0.4 and 0.3→0.5.4 respectively.
SQS Implementation `src/structuretoolkit/build/sqs.py`	Refactored `sqs_structures` to call new `sqsgenerator.optimize`/`to_ase` API; removed parameters (`prefactors`, `pair_weights`, `which`, `shell_distances`, `minimal`, `similar`); restructured settings payload with new keys (`iteration_mode`, serialized `structure`, `composition`, `thread_config`, `max_results_per_objective`, conditional `shell_weights`); rewrote result handling to iterate and build structures list via `to_ase`, extracting statistics from new result structure.
Test Updates `tests/test_sqs.py`	Removed deprecated parameters from `sqs_structures` calls; changed SRO validation from dictionary key checks to shape assertion `(5, 2, 2)`; updated `num_iterations` expectation to integer `1000000` and relaxed `cycle_time` threshold from `< 10` to `< 100000000000`.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

pyiron/structuretoolkit#218 — Originally added the tests/test_sqs.py test functions being modified here for API compatibility with the new sqsgenerator version.

Poem

🐰 Hoppity hoppity, structures align,
New sqsgenerator makes everything shine!
Parameters trimmed, payloads rearranged,
The SQS pipeline, beautifully changed. ✨

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 33.33% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Test latest sqsgenerator version' directly describes the main change across the pull request: updating sqsgenerator dependencies and adjusting code to work with the new versions.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch sqs

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codecov · 2026-03-27T09:02:32Z

Codecov Report

❌ Patch coverage is 94.11765% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 82.69%. Comparing base (f76f7f2) to head (58d82bf).
⚠️ Report is 5 commits behind head on main.

Files with missing lines	Patch %	Lines
src/structuretoolkit/build/sqs.py	94.11%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #470      +/-   ##
==========================================
- Coverage   82.85%   82.69%   -0.17%     
==========================================
  Files          25       25              
  Lines        1820     1832      +12     
==========================================
+ Hits         1508     1515       +7     
- Misses        312      317       +5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

jan-janssen · 2026-03-27T09:03:33Z

@dgehringer I finally got around updating structuretoolkit to be compatible to the latest version of sqsgenerator. Maybe you can take a quick look at the update.

Copilot

Pull request overview

Updates structuretoolkit’s SQS generation integration and tests to work with a newer sqsgenerator release.

Changes:

Bumped sqsgenerator in CI conda environments to newer pinned versions.
Refactored sqs_structures to use the newer sqsgenerator.optimize / to_ase API and new settings schema.
Adjusted SQS unit tests to match new statistics/SRO outputs.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 7 comments.

File	Description
`src/structuretoolkit/build/sqs.py`	Reworks SQS generation to call the new `sqsgenerator` API and changes how structures/SRO/timings are produced.
`tests/test_sqs.py`	Updates expectations and call signature for `sqs_structures` under the new `sqsgenerator` behavior.
`.ci_support/environment.yml`	Pins `sqsgenerator` to `0.5.4` in the main CI environment.
`.ci_support/environment-old.yml`	Pins `sqsgenerator` to `0.4` in the “old” CI environment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-27T09:04:20Z

src/structuretoolkit/build/sqs.py

+    structures = []
+    finished = False
+    for r_lst in result:
+        if not finished:
+            for s in r_lst[1]:
+                structures.append(to_ase(s.structure()))
+                if len(structures) == output_structures:
+                    finished = True
+                    break


After finished becomes true, the outer for r_lst in result: loop continues iterating even though no more structures are collected. This can do unnecessary work (and potentially consume a large iterator); break out of the outer loop once the requested output_structures have been gathered.

src/structuretoolkit/build/sqs.py

Copilot · 2026-03-27T09:04:21Z

src/structuretoolkit/build/sqs.py

+                    break
+
+    sro_breakdown = [s.sro() for s in result[0][1]]
+    cycle_time = list(result.statistics.timings.values())[0]


cycle_time = list(result.statistics.timings.values())[0] is fragile: it will raise if timings is empty, and it picks an arbitrary timing entry based on dict insertion order rather than a well-defined metric (the previous implementation averaged timings). Consider selecting a specific timing key or aggregating deterministically (e.g., average or sum) so the returned cycle_time is stable and meaningful.

Suggested change

cycle_time = list(result.statistics.timings.values())[0]

timings = list(result.statistics.timings.values())

cycle_time = float(np.mean(timings)) if timings else 0.0

Copilot · 2026-03-27T09:04:21Z

tests/test_sqs.py

+        self.assertEqual(num_iterations, 1000000)
+        self.assertTrue(cycle_time < 100000000000)


self.assertTrue(cycle_time < 100000000000) effectively stops testing anything about performance/regressions (almost any value will pass). If runtime is too variable to bound tightly, consider asserting basic invariants (e.g., cycle_time >= 0 and finite) or mocking/stubbing timing rather than using an extremely large upper bound.

tests/test_sqs.py

Copilot · 2026-03-27T09:04:22Z

src/structuretoolkit/build/sqs.py

 def sqs_structures(
    structure: Atoms,
    mole_fractions: dict[str, float | int],
    weights: dict[int, float] | None = None,
    objective: float | np.ndarray = 0.0,
    iterations: float | int = 1e6,
    output_structures: int = 10,
    mode: str = "random",
    num_threads: int | None = None,
-    prefactors: float | np.ndarray | None = None,
-    pair_weights: np.ndarray | None = None,
    rtol: float | None = None,
    atol: float | None = None,
-    which: Iterable[int] | None = None,
-    shell_distances: Iterable[int] | None = None,
-    minimal: bool | None = True,
-    similar: bool | None = True,
    return_statistics: bool | None = False,
 ) -> Atoms | tuple[Atoms, dict[str, list], int, float]:


sqs_structures is part of the public API (re-exported from structuretoolkit.build and top-level structuretoolkit). Removing parameters like prefactors, pair_weights, which, etc. is a breaking change for downstream callers; consider keeping them as deprecated keyword args (ignored or mapped to the new sqsgenerator API) with a warning, or bumping the major version and documenting the break explicitly.

Copilot · 2026-03-27T09:04:22Z

src/structuretoolkit/build/sqs.py

@@ -188,54 +182,54 @@
        output_structures (int): The number of output structures.
        mode (str): The mode for selecting configurations.
        num_threads (Optional[int]): The number of threads to use.
-        prefactors (Optional[Union[float, np.ndarray]]): The prefactors for each shell.
-        pair_weights (Optional[np.ndarray]): The pair weights.
        rtol (Optional[float]): The relative tolerance.
        atol (Optional[float]): The absolute tolerance.
-        which (Optional[Iterable[int]]): The indices of the shells to optimize.
-        shell_distances (Optional[Iterable[int]]): The distances for each shell.
-        minimal (Optional[bool]): Whether to minimize the objective function.
-        similar (Optional[bool]): Whether to generate similar structures.
        return_statistics (Optional[bool]): Whether to return additional statistics.

    Returns:
        Union[Atoms, Tuple[Atoms, Dict[str, list], int, float]]: The generated structures or a tuple containing the structures, short-range order parameters breakdown, number of iterations, and average cycle time.



The return type annotation and docstring say this returns an Atoms (or tuple with Atoms), but the implementation returns a list[Atoms] (and sro_breakdown is no longer a dict[str, list]). Please update the type hints and docstring to match the actual return types so callers and static checkers don’t get misled.

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/structuretoolkit/build/sqs.py`:
- Around line 219-231: The sro_breakdown is derived from result[0][1] regardless
of which SqsResult entries were actually appended to structures; change the loop
that builds structures (the block using structures, finished, result,
output_structures, and to_ase) to also collect the corresponding SqsResult
objects (e.g., selected_results) when you append a structure, and then compute
sro_breakdown from that same selection (sro_breakdown = [s.sro() for s in
selected_results]) instead of using result[0][1]; this ensures SRO data aligns
with the returned structures even when items are taken from multiple buckets or
truncated.

In `@tests/test_sqs.py`:
- Around line 63-66: Replace the brittle hard-coded stats assertions: instead of
asserting sro.shape == (5,2,2) and cycle_time < 100000000000, assert invariants
that don't depend on current shell count: check len(sro_breakdown) ==
len(structures_lst), for each sro assert sro.shape[1:] == (2, 2), and assert
cycle_time is a finite non-negative value (e.g., use math.isfinite(cycle_time)
and cycle_time >= 0); keep the num_iterations assertion if needed
(num_iterations == 1000000) or verify it against the test setup variable.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 30dd9f07-65d2-418e-a831-fcbd46b90908

📥 Commits

Reviewing files that changed from the base of the PR and between 5470cce and b06fb57.

📒 Files selected for processing (4)

.ci_support/environment-old.yml
.ci_support/environment.yml
src/structuretoolkit/build/sqs.py
tests/test_sqs.py

src/structuretoolkit/build/sqs.py

coderabbitai · 2026-03-27T09:13:54Z

tests/test_sqs.py

        for sro in sro_breakdown:
-            self.assertEqual(list(sro.keys()), ["Cu-Cu", "Cu-Au", "Au-Au"])
-        self.assertEqual(num_iterations, 1000000.0)
-        self.assertTrue(cycle_time < 10)
+            self.assertEqual((5, 2, 2), sro.shape)
+        self.assertEqual(num_iterations, 1000000)
+        self.assertTrue(cycle_time < 100000000000)


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

# First, locate and examine the test file find . -name "test_sqs.py" -type f

Repository: pyiron/structuretoolkit

Length of output: 86

🏁 Script executed:

# Read the test file to understand the context cat -n ./tests/test_sqs.py | head -100

Repository: pyiron/structuretoolkit

Length of output: 2758

🏁 Script executed:

# Search for the sqs_structures function implementation rg -n "def sqs_structures" --type py

Repository: pyiron/structuretoolkit

Length of output: 124

🏁 Script executed:

# Read the sqs_structures function sed -n '160,300p' ./src/structuretoolkit/build/sqs.py

Repository: pyiron/structuretoolkit

Length of output: 3144

🏁 Script executed:

# Check if there's documentation or comments about sro shape rg -n "sro\(\)" --type py -B 3 -A 3

Repository: pyiron/structuretoolkit

Length of output: 555

🏁 Script executed:

# Look at the structure more carefully - check what result[0][1] vs all r_lst[1] contains sed -n '215,235p' ./src/structuretoolkit/build/sqs.py

Repository: pyiron/structuretoolkit

Length of output: 817

🏁 Script executed:

# Check the full context around how structures are populated vs sro_breakdown sed -n '217,235p' ./src/structuretoolkit/build/sqs.py

Repository: pyiron/structuretoolkit

Length of output: 668

🏁 Script executed:

# Look at how the structures loop works and what result[0][1] contains # Let me examine if there's a mismatch between returned structures and sro_breakdown sed -n '217,235p' ./src/structuretoolkit/build/sqs.py | cat -n

Repository: pyiron/structuretoolkit

Length of output: 801

🏁 Script executed:

# Let me understand what structures_lst returns and what sro_breakdown contains # by looking at the return statement more carefully sed -n '160,240p' ./src/structuretoolkit/build/sqs.py | grep -A 5 "sro_breakdown\|structures ="

Repository: pyiron/structuretoolkit

Length of output: 536

🏁 Script executed:

# Let me understand the potential mismatch: structures stops at output_structures but sro_breakdown uses all of result[0][1] # Let me check the test expectations more carefully sed -n '41,66p' ./tests/test_sqs.py

Repository: pyiron/structuretoolkit

Length of output: 1139

🏁 Script executed:

# Let me check if there's a potential mismatch between structures and sro_breakdown # by understanding the structure better. The key issue: does structures get limited to 10 # but sro_breakdown uses all of result[0][1]? # First, let's trace through: output_structures=10 means the loop breaks after 10 structures # But sro_breakdown = [s.sro() for s in result[0][1]] uses ALL of result[0]'s structures # This could be a mismatch if result[0] has more than 10 structures, or if structures # come from multiple objectives (result[0], result[1], etc.) # Let me search for sqsgenerator documentation or usage hints rg -n "result\[0\]" --type py -B 2 -A 2

Repository: pyiron/structuretoolkit

Length of output: 442

🏁 Script executed:

# Let me verify if there's a potential mismatch by looking at the loop logic more carefully # The key observation: structures breaks after 10, but sro_breakdown uses result[0][1] completely sed -n '217,235p' ./src/structuretoolkit/build/sqs.py

Repository: pyiron/structuretoolkit

Length of output: 668

🏁 Script executed:

# Let me check the actual structure of what sqsgenerator returns # by looking for any related imports or type hints rg -n "from sqsgenerator import\|import sqsgenerator" --type py -A 5

Repository: pyiron/structuretoolkit

Length of output: 49

🏁 Script executed:

# Let me understand the potential mismatch between structures_lst and sro_breakdown # The key question: does result[0][1] always have exactly output_structures items? # Or can it have more/fewer? # Let me trace the logic: # - structures stops after output_structures (10) items are added # - sro_breakdown uses ALL of result[0][1] # These should match but might not # Let me search for any related issues or comments about this rg -n "len(structures_lst)" --type py -B 2 -A 2

Repository: pyiron/structuretoolkit

Length of output: 49

🏁 Script executed:

# Let me verify the critical mismatch: the structures loop stops at output_structures, # but sro_breakdown uses result[0][1] in its entirety. Are they the same length? # The key issue: # - structures breaks when len(structures) == output_structures (10) # - sro_breakdown = [s.sro() for s in result[0][1]] uses ALL of result[0][1] # # These might not match! Let me verify by checking the test assertions cat -n ./tests/test_sqs.py | sed -n '41,70p'

Repository: pyiron/structuretoolkit

Length of output: 1321

🏁 Script executed:

# Let me trace through the critical mismatch more carefully. # The structures loop breaks after 10 items, but sro_breakdown uses result[0][1] entirely. # Are they guaranteed to be the same length? # Let me check if there's any code that ensures they match sed -n '217,235p' ./src/structuretoolkit/build/sqs.py | python3 << 'EOF' import sys code = sys.stdin.read() print("=== Code Analysis ===") print("Loop: breaks after len(structures) == output_structures (10)") print("sro_breakdown: [s.sro() for s in result[0][1]]") print("\nPotential Mismatch:") print("- structures can have UP TO output_structures items from ANY objectives") print("- sro_breakdown uses ALL items from result[0][1] only") print("\nIf structures come from multiple objectives or result[0] has > 10:") print(" len(structures_lst) != len(sro_breakdown)") print("\nIf result[0] has exactly 10 and all structures come from result[0]:") print(" They match (coincidentally)") EOF

Repository: pyiron/structuretoolkit

Length of output: 554

These stats assertions are brittle and weak.

(5, 2, 2) hard-codes the current shell count even though shell selection is auto-derived when no shell settings are supplied, and cycle_time < 100000000000 is too loose to catch a regression in the new timing extraction. Prefer invariant checks here, e.g. len(sro_breakdown) == len(structures_lst), sro.shape[1:] == (2, 2), and a finite non-negative cycle_time.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@tests/test_sqs.py` around lines 63 - 66, Replace the brittle hard-coded stats assertions: instead of asserting sro.shape == (5,2,2) and cycle_time < 100000000000, assert invariants that don't depend on current shell count: check len(sro_breakdown) == len(structures_lst), for each sro assert sro.shape[1:] == (2, 2), and assert cycle_time is a finite non-negative value (e.g., use math.isfinite(cycle_time) and cycle_time >= 0); keep the num_iterations assertion if needed (num_iterations == 1000000) or verify it against the test setup variable.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

dgehringer · 2026-03-27T12:18:41Z

Hi @jan-janssen!

Thanks a lot for tackling this.

Yes for a basic interface that looks good to me

Test latest sqsgenerator version

ded1588

Copilot AI review requested due to automatic review settings March 27, 2026 09:00

[pre-commit.ci] auto fixes from pre-commit.com hooks

b06fb57

for more information, see https://pre-commit.ci

Copilot started reviewing on behalf of jan-janssen March 27, 2026 09:01 View session

Copilot AI reviewed Mar 27, 2026

View reviewed changes

jan-janssen added 3 commits March 27, 2026 10:05

update pymatgen to 2026.3.23

95681cb

Merge commit 'b06fb573749c5b6b4f07a82e8d3bb25d235a8f20' into sqs

2286474

Test with Python 3.14

6a41f18

coderabbitai bot reviewed Mar 27, 2026

View reviewed changes

jan-janssen and others added 2 commits March 27, 2026 10:15

Update src/structuretoolkit/build/sqs.py

f75ab23

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update tests/test_sqs.py

58d82bf

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

jan-janssen merged commit 684c115 into main Mar 27, 2026
19 of 20 checks passed

jan-janssen deleted the sqs branch March 27, 2026 09:22

coderabbitai bot mentioned this pull request Apr 1, 2026

sqs improve interface #476

Open

	cycle_time = list(result.statistics.timings.values())[0]
	timings = list(result.statistics.timings.values())
	cycle_time = float(np.mean(timings)) if timings else 0.0

		self.assertEqual(num_iterations, 1000000)
		self.assertTrue(cycle_time < 100000000000)

Conversation

jan-janssen commented Mar 27, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

codecov bot commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jan-janssen commented Mar 27, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dgehringer commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jan-janssen commented Mar 27, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 27, 2026 •

edited

Loading

codecov bot commented Mar 27, 2026 •

edited

Loading