387. First Unique Character in a String #16

kazukiii · 2024-06-19T21:58:37Z

問題へのリンク
https://leetcode.com/problems/first-unique-character-in-a-string/description/

README.mdへ頭の中の言語化と記録をしています。

nodchip · 2024-06-20T01:13:12Z

arai60/first-unique-character-in-a-string/README.md

+    - ソートして見ていく
+        - ソートすることにより同じ文字は隣合う
+        - 元のインデックスを保持しておく必要あり
+        - time: O(n lon n), space: O(n log n) -> time, spaceともにソートの分


ソートするときの空間計算量は O(n) ではないですか？ in-place なら O(1) になると思います。

ありがとうございます。空間計算量O(log n)の書き間違いでした。(C++のstd:sort()の標準の実装であるIntrosortはquick sortベースのため、最悪の場合はスタックサイズO(log n)使うという認識です)

ですが、この問題の場合元のインデックスを保持する関係でO(n)になりますね。。

class Solution { public: int firstUniqChar(string s) { vector<Character> s_with_index; for (int i = 0; i < s.size(); i++) { s_with_index.push_back({s[i], i}); } sort(s_with_index.begin(), s_with_index.end(), [](const Character& a, const Character& b) { return a.character < b.character; }); int count = 1; int answer = numeric_limits<int>::max(); for (int i = 0; i < s_with_index.size(); i++) { if (i < s_with_index.size() - 1 && s_with_index[i].character == s_with_index[i + 1].character) { count++; continue; } if (count == 1) { answer = min(answer, s_with_index[i].index); } count = 1; } return answer != numeric_limits<int>::max() ? answer : -1; } private: struct Character { char character; int index; }; };

nodchip · 2024-06-20T01:14:38Z

arai60/first-unique-character-in-a-string/step1.cpp

@@ -0,0 +1,28 @@
+class Solution {


step2 のように、 index を保持しないほうがシンプルに感じます。

nodchip · 2024-06-20T01:15:25Z

arai60/first-unique-character-in-a-string/step1.cpp

+            char_to_count[s[i]].index = i;
+        }
+
+        int answer = INT_MAX;


step2 のように、見つかったら即 return した方がシンプルに感じます。

また、 INT_MAX より numeric_limits::max() を使用したほうが、 C++ 感があると思います。

std::numeric_limits 知りませんでした。これから使っていこうと思います。
https://en.cppreference.com/w/cpp/types/numeric_limits

nodchip · 2024-06-20T01:21:29Z

arai60/first-unique-character-in-a-string/step1.cpp

+        }
+
+        int answer = INT_MAX;
+        for (const auto& [character, count]: char_to_count) {


x64 アーキテクチャーを仮定するのであれば、値が 64 ビット (8 バイト) 以内に収まる場合は、参照にしないほうが良いと思います。理由は、値が 64 ビット (8 バイト) 以内に収まる場合、汎用レジスター 1 本に格納できるためです。参照にした場合、変数にアクセスする際に毎回ポインター経由でアクセスされるようになるため、定数倍遅くなります。
ただ、実際にはコンパイラーが最適化してくれる可能性もあります。生成されるアセンブラーを確認したほうが良いかもしれません。

Effective C++ 第3版
20項　値渡しよりconst参照渡しを使おう
https://www.maruzen-publishing.co.jp/item/b294734.html

あたりに書いてあったと思います。

ありがとうございます、このようなコメント嬉しいです。CPUアーキテクチャについては詳しくないので、時間を取って勉強しようと思います。
現状、汎用レジスターのサイズ = 値渡しか参照渡しかの判断基準と暗記しておきます。(現代のコンピュータでは一般に64bit)

oda · 2024-06-20T08:15:41Z

arai60/first-unique-character-in-a-string/README.md

+        - time: O(n lon n), space: O(n log n) -> time, spaceともにソートの分
+    - 度数分布を取得してループ
+        - これも元のインデックスを保持する必要ありそう
+            - もしかしたらC++のunordered_mapは挿入順を保持する？


あー、unordered_map がどんなものか確認しておいてください。

map は「挿入ではなくて key の」順序が保たれます。

ありがとうございます。HashTableの標準的な実装方法について調べました。

以下、理解した内容です。
HashTableは、標準的にはbucketの配列として実装される。keyのハッシュ値に基づいて各バケットに分配。
なので、iteratorを使って挿入した順に取り出せるという保証は全くない。
ハッシュ衝突の解決には、一般にchain法とopen address法の2種類ある。C++ではchain法で実装されることが多い。(cpythonはopen address法で実装されているようです)
chain法で衝突を解決する場合、bucketの実装にはlinked listを使用する。

一方、mapはbalanced BST(標準的にはred-black tree)を使って実装されており、各ノードにkey, valueを持っており、keyの大小関係によって並び替えられる。
https://en.cppreference.com/w/cpp/container/map

Yoshiki-Iwasa · 2024-06-24T06:06:15Z

arai60/first-unique-character-in-a-string/step3.cpp

+        for (int i = 0; i < s.size(); i++) {
+            if (char_to_count[s[i]] == 1) return i;
+        }
+        return -1;


-1に名前をつけてあげるのも可読性をあげる選択肢かなと思います

const int NOT_FOUND = -1

みたいな
マジックナンバーは極力回避する方がいいのかなと

コメントありがとうございます。マジックナンバーは極力避けた方が良いですね。

Yoshiki-Iwasa · 2024-06-24T06:23:27Z

arai60/first-unique-character-in-a-string/step4_balanced_bst.cpp

+        map<int, char> first_index_to_char;
+        unordered_map<char, int> char_to_first_index;


ここの変数名、ちょっと違和感があります。
まず、変数名は"それが何であるか"を示すものという感覚があります。

"どのような構造か"は型情報から得られるので変数名に露出する必要は無いと思います

first_index_to_charとchar_to_first_indexそれぞれなんのための変数かコードを下まで読まないとわからないので認知不可が高く感じました。

ちょっと完全に意図が汲み取れていないのですが、mapの構造に関連するような x_to_y のような変数名は認知不可が高いということでしょうか？

恐らく変数名に型情報（今回の場合はchar）を入れる必要はない（又はもっと優先度の高い情報がある）という話ではないでしょうか。
多分charは型情報というよりは対象の文字という意味で使われているとは思うのですが。

あ、charの方ですね。おっしゃる通り対象の文字という意味で使っていました。
とはいえ、型名と被ると混乱しますね。気をつけます。

Yoshiki-Iwasa · 2024-06-24T06:24:18Z

arai60/first-unique-character-in-a-string/step4_balanced_bst.cpp

+            char_to_first_index[s[i]] = i;
+        }
+
+        return !first_index_to_char.empty() ? first_index_to_char.begin()->first : -1;


Suggested change

return !first_index_to_char.empty() ? first_index_to_char.begin()->first : -1;

return first_index_to_char.empty() ? -1: first_index_to_char.begin()->first;

のほうがシンプルかなと思いました

kazukiii added 2 commits June 19, 2024 14:55

step1, step2, step3を追加

67f051b

一部修正

3f9bd39

nodchip reviewed Jun 20, 2024

View reviewed changes

oda reviewed Jun 20, 2024

View reviewed changes

step4を追加

50a8420

Yoshiki-Iwasa reviewed Jun 24, 2024

View reviewed changes

kazukiii mentioned this pull request Aug 14, 2024

108. Convert Sorted Array to Binary Search Tree Ryotaro25/leetcode_first60#26

Open

rihib mentioned this pull request Aug 15, 2024

First Unique Character in a String rihib/leetcode#18

Closed

colorbox mentioned this pull request Nov 27, 2024

387. First Unique Character in a String colorbox/leetcode#29

Merged

		map<int, char> first_index_to_char;
		unordered_map<char, int> char_to_first_index;

	return !first_index_to_char.empty() ? first_index_to_char.begin()->first : -1;
	return first_index_to_char.empty() ? -1: first_index_to_char.begin()->first;

387. First Unique Character in a String #16

Are you sure you want to change the base?

387. First Unique Character in a String #16

Uh oh!

Conversation

kazukiii commented Jun 19, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Yoshiki-Iwasa Jun 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Yoshiki-Iwasa Jun 24, 2024 •

edited

Loading