# If you'd like to omit non-matching lines from the result; add ';d' to the end of the expression.
sed -E 's/<div\b.*?\bdata-id=\"([^"]*)\">([^<]*).*?\\/>([^<]*)<div\b.*?\bdata-id=\"([^"]*)\">([^<]*).*/@['id':'$1','tag':'$2']; , $3@['id':'$4','tag':'$5'];/gm;t' <<< "<div id=\"tarea\" contentEditable=\"true\" class=\"tarea\" autocorrect=\"false\" spellCheck=\"off\">
Hello Says! <div class=\"tag\" data-id=\"1005\">Vedant Terkar</div> , <br />To all <div class=\"tag\" data-id=\"1006\">SO Users</div> :-).
</div>
<!-- This textarea is Usually Hidden -->
<textarea id=\"opc\" rows=\"5\" cols=\"97\">
</textarea>"
Please keep in mind that these code samples are automatically generated and are not guaranteed to work. If you find any syntax errors, feel free to submit a bug report. For a full regex reference for SED, please visit: https://www.gnu.org/software/sed/manual/html_node/The-_0022s_0022-Command.html