Background
Understanding Bisect module in Python
- bisect_left(list, element , low, high)
- list - a sorted list where we want to find the element to be inserted
- element - The element to be inserted
- low - the starting index of the list from where we want to start the search
- high - the ending index of the list from where we want to end the search
1 2 3 4 5 6 7 | import bisect list = [ 1 , 3 , 5 , 9 , 10 , 15 ] idx = bisect.bisect_left(list, 6 ) print(f "Index to insert 6 is {idx}" ) list.insert(idx, 6 ) print(f "New list {list}" ) |
- insort_left(list, element)
1 2 3 4 5 | import bisect list = [ 1 , 3 , 5 , 9 , 10 , 15 ] bisect.insort_left(list, 6 ) print(f "New list {list}" ) |
1 2 3 4 5 6 7 | import bisect list = [ 1 , 3 , 5 , 5 , 5 , 9 , 10 , 15 ] idx = bisect.bisect_left(list, 5 ) print(f "Left most index to insert 5 is {idx}" ) idx = bisect.bisect_right(list, 5 ) print(f "Right most index to insert 5 is {idx}" ) |
1. Binary Search
1 2 3 4 5 6 7 8 9 10 11 | import bisect def binary_Search(arr, element, low, high): idx = bisect.bisect_left(arr, element, low, high) if idx < len(arr) and arr[idx] == element: return idx return - 1 arr = [ 1 , 3 , 5 , 7 , 12 , 20 ] search_idx = binary_Search(arr, 7 , 0 , len(arr)) print(f "Binary search index for 7 : {search_idx}" ) |
2. Prefix search
1 2 3 4 5 6 7 8 9 10 11 | import bisect def prefix_search(arr, prefix): idx = bisect.bisect_left(arr, prefix) if idx >= len(arr): return None el = arr[idx] return el if el.startswith(prefix) else None arr = [ "apple" , "cat" , "dog" , "elephant" ] print(prefix_search(arr, "ap" )) |
3. Find no of repeating values
1 2 3 4 5 6 7 8 9 10 11 12 | import bisect def count_repeated_no(arr, no): left_idx = bisect.bisect_left(arr, no) right_idx = bisect.bisect_right(arr, no) if left_idx >= len(arr): return - 1 else : return right_idx - left_idx arr = [ 1 , 2 , 5 , 5 , 5 , 5 , 9 , 1 ] print(f "Count of 5 in array {count_repeated_no(arr, 5)}" ) |
Performance
- One because it uses binary search so it's time complexity is O(LogN)
- Secondly, it's precompiled in C so it's faster than if you implement it yourself (See SO question on comparison with direct lookup in list)