# If you'd like to omit non-matching lines from the result; add ';d' to the end of the expression.
sed -E 's/((?:(?:https?|ftp):\\/{2})(?:(?:[0-9a-z_@-]+\.)+(?:[0-9a-z]){2,4})?(?:(?:\\/(?:[~0-9a-z\#\+\%\@\.\\/_-]+))?\\/)*(?=\s|\"))(\\/)?(index\.html?)?/\1\/index.html/gi;t' <<< "there is some text with <a href=\" http://www.Something.com/ \"> but without index.html; then there is another one that is <a href = \"https://www.other.com/more/stuff/index.html\"> which should be unaltered.
Finally does it work for href = \"http://www.com/stuff/\"
http://www.index.com/
http://ex.com/a/b/\"
http://www.example.com add /index.html
http://www.example.com/something do nothing
http://www.example.com/something/ add /index.html
http://www.example.com/something/index.html do nothing
"
Please keep in mind that these code samples are automatically generated and are not guaranteed to work. If you find any syntax errors, feel free to submit a bug report. For a full regex reference for SED, please visit: https://www.gnu.org/software/sed/manual/html_node/The-_0022s_0022-Command.html