For example: you have a text “GO” when u point the camera at the text you should recognise the text and a 3D arrow-mark should be placed in the real world.
how can it be done using CoreMl, Vision and ARKit.
if it is not possible in these frameworks, what are the other Frameworks for it.
Run an ARKit world tracking session.
Pass along the current camera frame to Vision and perform a text recognition request. Recognizing Text in Images might be a good read to get started with text recognition. To learn how to use a combination of ARKit and Vision in your app, you can check out the Tracking and Altering Images developer sample.
When Vision detects the text, you will get its bounding box in the image in 2D.
By performing a raycast in ARKit based on the 2D position, you can determine the 3D coordinate where to place your arrow.